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Schistosomiasis is a major neglected tropical disease caused by trematodes from the genus Schistosoma. Because 
schistosomes exhibit a complex life cycle and numerous mechanisms for regulating gene expression, it is believed 
that spliced leader (SL) trans-splicing could play an important role in the biology of these parasites. The purpose 
of this study was to investigate the function of trans-splicing in Schistosoma mansoni through analysis of genes that 
may be regulated by this mechanism and via silencing SL-containing transcripts through RNA interference. Here, we 
report our analysis ofSL transcript-enriched cDNA libraries from different S. mansoni life stages. Our results show 
that the trans-splicing mechanism is apparently not associated with specific genes, subcellular localisations or life 
stages. In cross-species comparisons, even though the sets of genes that are subject to SL trans-splicing regulation 
appear to differ between organisms, several commonly shared orthologues were observed. Knockdown of trans- 
spliced transcripts in sporocysts resulted in a systemic reduction of the expression levels of all tested trans-spliced 
transcripts; however, the only phenotypic effect observed was diminished larval size. Further studies involving the 
findings from this work will provide new insights into the role of trans- splicing in the biology ofS. mansoni and 
other organisms. All Expressed Sequence Tags generated in this study were submitted to dbEST as five different 
libraries. The accessions for each library and for the individual sequences are as follows: (i) adult worms of mixed 
sexes (LIBEST_027999: JZ139310 - JZ139779), (ii) female adult worms (LIBEST_028000: JZ139780 - JZ140379), (Hi) 
male adult worms (LIBEST_028001: JZ140380 - JZ141002), (iv) eggs (LIBEST_028002: JZ141003 - JZ141497) and (v) 
schistosomula (LIBEST_028003: JZ141498 - JZ141974). 
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Schistosomiasis is an important neglected tropi- 
cal disease caused by species of the parasitic flatworm 
Schistosoma. According to the World Health Organiza- 
tion (WHO 2012), the disease affects more than 230 mil- 
lion people yearly and the resulting morbidity compro- 
mises local economies and child development (Fenwick 
& Webster 2006). Transmission of schistosomiasis has 
been documented in 77 countries, with more than 90% 
of cases occurring on the African continent. Schistosoma 
mansoni, which is mainly found in Africa and Brazil, is 
a major cause of intestinal schistosomiasis. This parasite 
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exhibits a complex life cycle involving a snail interme- 
diate host and a mammalian definitive host (Pessoa & 
Martins 1982). 

Gene discovery in S. mansoni has taken advantage of 
extensive and well-annotated Expressed Sequence Tags 
(EST) databases (dbEST, SchistoDB and GeneDB) and, 
more recently, the Sequence Read Archive, containing 
next-generation sequencing reads covering the entire 
transcriptome of this species (Boguski et al. 1993, Zerlo- 
tini et al. 2009, Leinonen et al. 2011, Logan-Klumpler et 
al. 2012). Additionally, reverse genetic approaches have 
been widely explored in this parasite in order to provide 
perspectives on the identification of new targets for drug 
and vaccine development and to develop novel protocols 
for diagnosis (Skelly et al. 2003, Kalinna & Brindley 
2007, Mann et al. 2008, Pearce & Freitas 2008, Mourao 
et al. 2009a, b, Yoshino et al. 2010). 

Schistosomes possess numerous and complex tran- 
scriptional and post-transcriptional gene regulatory 
mechanisms allowing them to maintain their complex 
life cycle. Because of the prominence of the spliced 
leader (SL) sequence in a number of 5. mansoni messen- 
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ger RNA (mRNAs), it is presumed that SL trans-splicing 
represents an important form of post-transcriptional reg- 
ulation and could be a potential target for impairing S. 
mansoni development (Davis et al. 1995). All organisms 
that exhibit trans-splicing display one or more SL-RNAs, 
which are products of tandemly repeated small intron- 
less genes transcribed by DNA polymerase II (Hastings 
2005). SL-RNAs are small non-coding RNAs of 40-140 
nucleotides in length, carrying a donor splice site and a 
hyper-modified cap (Nilsen 1993). The donor splice site 
divides the SL-RNA into two segments: a 5' leader se- 
quence and an intron-like sequence at the 3' end. Despite 
a lack of sequence similarity, SL-RNAs from different 
organisms exhibit an impressive similarity in secondary 
structure to small nuclear RNAs, which are components 
of the spliceosome and actively participate in all splicing 
mechanisms (Hastings 2005). 

The function of trans-splicing is still poorly under- 
stood. Although the best-documented function of SL 
trans-splicing is in the generation of monocistronic tran- 
scripts from polycistronic operons (Blumenthal & Glea- 
son 2003), trans-splicing has also been implicated in a 
variety of functions associated with RNA maturation, 
including (i) providing a 5' cap for RNAs transcribed by 
RNA polymerase I (Lee & Van der Ploeg 1997, Gunzl et 
al. 2003), (ii) enhancing translation through the addition 
of a hyper-modified 5' cap in immature mRNAs and (iii) 
removing potentially deleterious elements within the 5' 
UTR (e.g., sequences that could compromise mRNA 
translation) (Hastings 2005, Matsumoto et al. 2010). 

Thus far, SL trans-splicing mechanisms have been 
identified in cnidarians, primitive chordates, nematodes, 
platyhelminthes and dinoflagellates (Krause & Hirsh 
1987, Rajkovic et al. 1990, Brehm et al. 2000, 2002, Sto- 
ver & Steele 2001, Vandenberghe et al. 2001, Zayas et 
al. 2005, Lidie & van Dolah 2007). In contrast, trans- 
splicing has never been described in plants, vertebrates 
or fungi, which raises many questions regarding the oc- 
currence of SL trans-splicing in an evolutionary context 
and its role in post-transcriptional regulation in selected 
species. In S. mansoni, SL trans-splicing does not ap- 
pear to be associated with any particular tissue, develop- 
mental phase or sex (Davis 1996). Moreover, there is no 
conclusive evidence associating SL trans-splicing with 
specific genes or gene families (Davis et al. 1995). The 
main goal of the present study was to identify genes or 
gene categories that could be targeted by trans-splicing 
in different schistosome life cycle stages and to provide 
a better understanding of the importance of the trans- 
splicing mechanism during S. mansoni development. 

MATERIALS AND METHODS 

Biological samples - The S. mansoni life cycle 
was maintained at the Rene Rachou Research Centre 
(CPqRR), Oswaldo Cruz Foundation, and at Interde- 
partmental Group for Epidemiological Research, De- 
partment of Parasitology, Federal University of Minas 
Gerais (UFMG), Brazil. The LE strain of 5. mansoni was 
maintained in the snail intermediate host Biomphalaria 
glahrata (Barreiro de Cima strain). Outbred Swiss Web- 
ster mice were housed conventionally in polypropylene 



cages with stainless steel screen covers. All animals re- 
ceived laboratory mouse chow and water ad libitum. The 
experimental protocols described herein were reviewed 
and approved by the Ethical Review Committee for Ani- 
mal Experimentation (CETEA) of UFMG (185/2006). 

Adult worms were obtained via portal perfusion of 
mice that had been infected for five weeks as previously 
described (Smithers & Terry 1965). The worms were 
washed with cold saline solution, carefully separated 
based on sex with fine forceps under a microscope and 
immediately frozen at -80°C until further processing. 
Mechanically transformed schistosomula (7-day-cul- 
tured) (Basch 1981) were provided by CPqRR. S. manso- 
ni eggs were recovered from the intestinal homogenates 
of 48-day-infected Swiss Webster mice. The collected 
tissues and eggs were filtered through a sieve to remove 
coarse debris and then allowed to settle. The resulting 
pellet was washed with 1.7% saline and frozen at -80°C 
for further processing. 

RNA isolation, reverse transcriptase (RT) and SL 
transcript enrichment - RNA from male and female adult 
worms and cultured schistosomula was extracted using 
the RNAgents kit (Promega, Madison, USA) following 
the manufacturer's protocol. Total RNA from S. man- 
soni eggs was extracted using the TRIzol Reagent (Life 
Technologies, Carlsbad, CA, USA) according to manu- 
facturer's protocol. Direct isolation of poly(A)+ mRNA 
from adult worms was performed using Dynabeads 
Oligo (dT)25 magnetic beads (Dynal, Life Technolo- 
gies, Carlsbad, USA). Briefly, following the extraction 
step, beads containing bound mRNA were re-suspended 
in 20 (xL of IX RT buffer (Life Technologies, Carlsbad, 
USA) and used directly in RT-polymerase chain reaction 
(PCR) assays. 

The cDNA synthesis was carried out from all sam- 
ples, except for those from adult worms, using Super- 
Script II Reverse Transcriptase (Life Technologies, 
Carlsbad, USA), according to procedures outlined by the 
manufacturer. For adult worms, 1 |.iL of beads contain- 
ing mRNA was added directly to the reaction. The oligo 
dT-anchored primer used for cDNA first-strand synthe- 
sis (5'CGGTATTTCAGTCGGTGTTCAAACCT19V3' - 
V = A, G, C) was designed to contain a 5' tail that was 
later employed in a PCR assay to amplify trans-spliced 
transcripts. The strategy for enriching cDNA libraries in 
trans-spliced transcripts included a PCR step involving 
a 5' tailed oligo dT sequence (Brehm et al. 2000) and 
part of the previously described S. mansoni SL sequence 
(underlined), 5'AACCGTCACGGTTTTACTCTTGTGA 
TTTGTTGCATG3' (Davis et al. 1995). 

To prevent the amplification of spurious cDNA se- 
quences, a step-down program was employed for PCR, 
consisting of one cycle of 5 min at 95°C for DNA dena- 
turation, five cycles of 1 min at 95°C, 1 min at 60°C and 
1.5 min at 72°C, five cycles of 1 min at 95°C, 1 min at 
59°C and 1.5 min at 72°C, five cycles of 1 min at 95°C, 1 
min at 58°C and 1.5 min at 72°C and 19 cycles of 1 min 
at 95°C, 1 min at 57°C and 1.5 min at 72°C. 

Size selection of cDNAs - Fragment size selection was 
performed to prevent over-representation of small PCR 
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products. To accomplish this, two methodologies were 
applied: (i) PCR amplicon separation via electrophoresis 
in a 1% agarose gel and further isolation using the en- 
zyme P-agarase according to Franco et al. (1995) and (ii) 
precipitation with 15% polyethyleneglycol 8000 (to ob- 
tain fragments > 400 bp). The purified amplicons were 
cloned into the pGEM and pCR2.1 vectors using the T- 
easy System Vector kit (Promega, Madison, USA) and a 
TA cloning kit (Life Technologies, Carlsbad, USA), re- 
spectively, according to manufacturer's specifications. 

The recombinant plasmids were used to transform 
competent Escherichia coli of the DH5a strain. To select 
clones with large inserts and test library quality, recom- 
binant bacterial clones were subjected to colony PCR us- 
ing M13 forward and reverse primers. Amplification and 
insert size estimates were confirmed via electrophoresis 
in a 1% agarose gel. Selected clones were grown over- 
night in 2X YT medium (16 g of bacto tryptone, 10 g 
of bacto yeast extract, 5 g of NaCl per litre, pH 7.0) and 
recombinant plasmids were purified using a standard 
protocol. Template preparation and DNA sequencing re- 
actions were conducted through DYEnamic ET dye ter- 
minator cycle sequencing (GE Healthcare), following the 
manufacturer's protocol with a MegaBACE 1000 capil- 
lary sequencer (GE Healthcare). 

S. mansoni in vitro culture and small interfering (siR- 
NA) treatment - All RNA interference (RNAi) experi- 
ments were performed using the Naval Medical Research 
Institute strain of S. mansoni. Eggs were obtained from 
the livers of mice that had been infected for seven-eight 
weeks. Transformation was carried out as previously de- 
scribed (Yoshino & Laursen 1995). Larvae were counted 
and distributed into either 48 or 96-well polystyrene tis- 
sue culture plates (Costar, Corning Incorporated, NY, 
USA) at concentrations of -6,000 [RT-quantitative PCR 
(qPCR)] or -500 miracidia/well (Mourao et al. 2009a). All 
RNAi experiments involved at least two technical repli- 
cates of the miracidial treatment and control groups and 
were repeated in three independent larval cultures. The 
parasites in culture were exposed on day zero to SL se- 
quence siRNAs (treatment), an irrelevant siRNA (control 
I; decoy) or medium alone (control II). Cultured larvae 
were assessed for knockdown effects after seven days of 
treatment (Mourao et al. 2009a). The RNAi experiments 
involving mice were pre-approved by the Institutional 
Animal Care and Use Committee of the University of 
Wisconsin-Madison, where the experiments were con- 
ducted, under assurance A3368-01. 

The double-stranded siRNA sequences of 21 nucle- 
otides in length were designed using BLOCK-iT^M 
RNAi Designer tools, available from rnaidesigner.in- 
vitrogen.com/rnaiexpress (Life Technologies, Carlsbad, 
CA, USA). The generated sequences were synthesised 
by Life Technologies using the StealthTM proprietary 
modification. The decoy control was designed with a 
similar GC content and length as the target SL-siRNA. 

Phenotypic screening - Cultured S. mansoni larvae 
were plated in 96-well culture plates (Costar) at a den- 
sity of approximately 500 miracidia per well, which con- 
tained a 200 pM concentration of SL-siRNA (experiment 



group) or decoy sequence siRNA (control I) diluted in 
200 |.iL of CBSS or medium lacking siRNA (control II). 
The cultures were maintained at 26°C for seven days, af- 
ter which the sporocysts were monitored for the follow- 
ing phenotypes: failure or delay of transformation, loss 
of motility, tegumental lysis and granulation (lethality) 
and changes in larval growth. Parasite viability and mor- 
phological changes were monitored daily as previously 
described (Mourao et al. 2009a). Length measurements 
were performed in captured images using Metamorph 
software, version 7.0 (Meta Imaging series. Molecular 
Devices, Sunnyvale, CA, USA). Larval growth datasets 
for each experimental replicate were statistically analy- 
sed using the Mann-Whitney U test (Wilcoxon-Sum of 
Ranks test) at a significance level of p < 0.05. All treat- 
ments were performed in triplicate wells and were in- 
dependently replicated three times in miracidia isolated 
from different batches of infected mouse livers. 

Effect of double-stranded RNA treatment on larval 
gene expression - qPCR was used to determine steady- 
state transcript levels in specific ds-siRNA-treated 
sporocysts. In these experiments -6,000 miracidia were 
distributed into a 48 -well plate (Costar) and treated with 
200 nM siRNA diluted in CBSS (500 ^iL/well). The cul- 
tures were maintained at 26°C for seven days prior to 
RNA extraction and isolation. 

Following incubation, the sporocysts were extensively 
washed with CBSS to eliminate unabsorbed siRNAs and 
shed ciliary epidermal plates, followed by extraction in 
TRIzol Reagent (Life Technologies, Carlsbad, CA, USA) 
to isolate total RNA from cultured larvae (Mourao et al. 
2009a). Isolated RNA was resuspended in diethylpyro- 
carbonate-treated water and subjected to DNAse treat- 
ment using the Turbo DNA-Free kit (Ambion, Austin, 
TX, USA) to eliminate contaminating genomic DNA. 
The RNA samples were then quantified and their purity 
assessed in a Nanodrop Spectrometer, ND-1000 (Nano- 
Drop Technologies, Inc, Wilmington, DE, USA). 

RT-qPCR analysis - To evaluate transcript levels 
between the SL-siRNA-treated sporocysts and control 
treatment (decoy-siRNA), we performed a qPCR analy- 
sis. To this end, 0.5 |j,g of total RNA derived from at 
least three different extractions was employed to syn- 
thesise cDNA using the Superscript III cDNA Synthesis 
kit (Invitrogen) following the manufacturer's protocol. 
The RT-qPCR assay mixtures consisted of 2.5 \xL of 
cDNA, 12.5 ^L of SYBR Green PCR Master Mix (Ap- 
plied Biosystems, Foster City, CA, USA) and 10 (xL of 
600 or 900 nM primers, determined after primer con- 
centration optimisation following the Minimum Infor- 
mation for Publication of Quantitative Real-Time PCR 
Experiments guidelines (Bustin et al. 2009), in 96-Well 
Optical Reaction Plates (ABI PRISM, Applied Biosys- 
tems, Foster City, CA, USA). The reactions were carried 
out using the AB7500 Real-Time PCR System (Applied 
Biosystems, Foster City CA, USA). RT-qPCR valida- 
tions were performed with the SL forward primer 5'- 
GTCACGGTTTTACTCTTGT-3' and a gene-specific 
reverse primer. Specific primers were designed for (i) 
five previously reported trans-spliced transcripts (Da- 
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vis et al. 1995), (ii) S. mansoni a-tubulin, used as an 
endogenous normalisation control in all tested samples 
and (iii) the S. mansoni genes encoding three known 
non-trans-spliced transcripts, which were used as nega- 
tive controls (Supplementary data). Each RT-qPCR run 
was conducted with two internal controls for assessing 
potential genomic DNA contamination (no RT) and the 
purity of the reagents used (no cDNA). For each specific 
set of primers, all individual treatments (including spec- 
ificity controls) were run in three technical replicates. 
Each experiment was repeated three times as indepen- 
dent biological replicates and the results were analysed 
via the AACt method (Livak & Schmittgen 2001). Due to 
the nonparametric distribution of data, statistical analy- 
sis of the AACt values was carried out using the Mann- 
Whitney U test, with significance set at p <0.05. 

Bioinformatics analysis - The output files gener- 
ated from the sequencing reactions (SL-enriched EST 
library) and from publicly available S. mansoni EST 
data (dbEST) (Boguski et al. 1993) were submitted 
to a bioinformatics pipeline including algorithms for 
base-calling, poly-A and vector decontamination, motif 
searching, similarity-based characterisation, gene ontol- 
ogy (GO) assignment and manually curated annotation 
and analysis (Fig. 1). All sequence retrieval (except when 
otherwise stated) was performed within the SchistoDB 
database. Information on exon content was also obtained 
from this database. 

Sequence processing - Phred (Ewing et al. 1998) was 
employed as a base-calling procedure. Only sequences 
of at least 150 bases with quality scores higher than 10 
were accepted. A multi-Fasta file was generated with 
all resulting sequences and was used throughout the 
subsequent analysis. Following Phred processing, the 
sequences from the SL-enriched EST library were sub- 
jected to analysis with the SeqClean program (available 
from: compbio.dfci.harvard.edu/tgi/software/), which 
consists of a Tiger-developed script capable of analysing 
EST data. SeqClean was employed to trim ESTs based on 
informational content, length and vector contamination. 
We used the vector database UniVec (Kitts et al. 2011) 
with two additional vectors (pGEM and pCR2.1) to scan 
our sequences for vector contamination with SeqClean. 
Poly-A tails and vector adaptors were also removed in 
this step. Only ESTs longer than 50 bp (following vec- 
tor cleaning) were further evaluated. S. mansoni dbEST 
data were further submitted to the SeqClean cleaning 
pipeline and analysed together with the SL-enriched 
EST library data. 

SL detection and cleaning - To identify SL-contain- 
ing sequences in both the SL-enriched EST library and 
dbEST, we used BLAST algorithms. To classify ESTs 
as SL-containing ESTs, we defined an e-value cut-off 
of 5 X 10"' and considered only sequences exhibiting at 
least 95% similarity and 25 contiguous nucleotides when 
aligned to the known SL sequence. After identifying 
the sequences as SL-containing ESTs, we used the local 
alignment software cross match (Ewing et al. 1998) to 
remove the SL region from all SL-containing sequences. 



Sequence clustering and mapping - Both the SL-en- 
riched EST library and S. mansoni dbEST were analy- 
sed using the cap3 program (Huang & Madan 1999) to 
generate sequence contigs. We employed cap3 with the 
default parameters, except for the -o and -p flags, which 
were set to 40 and 95, respectively. Therefore, two ESTs 
were only grouped into a single contig when they shared 
at least 40 aligned nucleotides with a minimum sequence 
identity of 95%. After identifying the SL-containing 
sequences within the SL-enriched EST library and as- 
sembling the corresponding contigs, we mapped the 
sequences in the S. mansoni genome. For this purpose, 
we used BLAST to search for the previously assembled 
unique sequences in the parasite genome. The defined 
cut-off for the e-value was 1 x 10"'° and the minimum 
sequence similarity to be accepted was 90%). 

GO assignment and annotation - We used SchistoDB 
(Zerlotini et al. 2009) to automatically annotate unique 
sequences from the SL-enriched EST library. GO as- 
signment was performed for the set of sequences gener- 
ated from the previous steps. GO categories were associ- 
ated with each transcript using GoAnna (McCarthy et al. 
2006). GO Slim terms (McCarthy et al. 2006) were also 
retrieved for all sequences to obtain a more general over- 
view of the GO among the dataset. Manual functional 
annotation was conducted for all known proteins (ex- 
cluded sequences were characterised as "expressed pro- 
teins" and "hypothetical proteins"). A literature search 
and homology analysis were carried out to assure correct 
functional annotation of the gene products. All proteins 
were clustered according to biological processes catego- 
ries to provide a better understanding of the trans-splic- 
ing function in the cell. 

RESULTS 

Dataset generation and data analysis - To determine 
whether the trans-splicing mechanism could target spe- 
cific functional gene categories in S. mansoni, we first 
generated an SL-enriched EST dataset containing subsets 
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Fig. 1: fluxogram illustrating each step of processing and curation 
performed with sequences from the dbEST and the spliced leader 
(SL)-enriched Expressed Sequence Tags library. The fluxogram con- 
tains both the names of all programs and methods (blue shaded boxes) 
used in this study for sequence cleaning, validation, trimming and 
annotation and the results generated by those (pink shaded boxes). 
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from diverse parasite life cycle stages. The enriched EST 
dataset yielded a total of 3,087 sequences, 481 of which 
were from schistosomula, while 502 were from eggs, 600 
from females, 623 from males and 881 from adult worms 
of mixed sex. After the removal of spurious sequences 
and vector contamination using SeqClean, 2,781 valid se- 
quences were retained in the dataset. Of these sequences, 
1,665 were classified as SL-containing sequences accord- 
ing to the previously described criteria. When the ratio of 
valid ESTs in the SL-enriched dataset containing the SL 
sequence vs. the total number of ESTs was compared, we 
found that 59.8% of the ESTs carried the SL sequence 
according to our very strict stringency parameters (as re- 
ported in the Materials and Methods section). This high 
percentage of SL sequences confirmed the enrichment 
of our SL dataset, as the SL/non-SL transcript ratio was 
only 0.1% when the entire dbEST dataset was analysed 
using the same protocol (Table I). Furthermore, recent 
data generated through RNA-Seq analysis (Protasio et 
al. 2012) indicate that -11% (1,178 SL transcripts) of all 
S. mansoni transcripts are processed by trans-splicing. 
Taken together, these data suggest a high enrichment of 
SL-containing sequences in our dataset. 

All 1,665 SL-containing sequences were subjected 
to BLAST searches in the SchistoDB database to map 
them into the S. mansoni genome and assign annota- 
tions and ID numbers according to SchistoDB data. 
Nine hundred eighty-nine sequences were mapped and 
further clusterised by cap3, resulting in 258 unique se- 
quences (102 singlets and 156 contigs). The remaining 
676 sequences corresponded to redundant sequences or 
to sequences that did not map to the S. mansoni genome. 
The number of unique sequences per life stage is listed 
in Table I. These 258 unique sequences were mapped 
to a set of 162 different protein-coding sequences from 
the parasite genome. Sixty-four unique sequences were 
classified as "conserved hypothetical proteins" (7), 
"hypothetical proteins" (10) or "expressed proteins" 
(47) and were therefore not included in the functional 
analysis. A final set of 99 unique sequences was defined 
and was employed in all subsequent analyses (Fig. 2, 
Supplementary data). 

GO and manual functional annotation - To identify 
the biological processes, subcellular localisations and 
molecular functions associated with the trans-spliced 
transcripts, GO terms were assigned to 78 of the 99 
protein-coding unique sequences (the remaining 21 
were not associated with any GO term by GOAnna). Of 
the 78 proteins assessed, only 30 were assigned a par- 
ticular subcellular localisation based on GOSlim results. 
Among these proteins, eight were localised to the mem- 
brane, eight were cytoplasmic (including 2 cytoskele- 
ton-associated proteins), five were nuclear proteins, five 
were mitochondrial proteins and another five were clas- 
sified merely as intracellular proteins, with no specific 
localisation. Based on analysis of the molecular func- 
tions assigned to the sequences through GO annotation, 
24 proteins were classified as metal-binding (binding 
calcium, magnesium, iron, zinc and other metal ions), 
12 as nucleotide -binding, 10 as nucleic acid-binding (1 
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specifically interacting with RNA and 4 with DNA) and 
five as adenosine triphosphate (ATP)-binding proteins. 
Additionally, four proteins were classified as glycolytic 
enzymes. In the biological processes category, 21 pro- 
teins were identified as functioning in metabolic pro- 
cesses, eight of which were associated with biosynthesis, 
while five proteins were classified as being related to 
redox mechanisms. The functions of the remainder of 
the proteins were not specified. 

Although several other trends were observed based 
on GO annotation, there were no clear biases found 
within the analysed trans-spliced protein dataset regard- 
ing cellular location, biological processes or molecular 
function. This result is in agreement with the current 
opinion that the trans-splicing mechanism is not associ- 
ated with any specific gene category or protein feature. 

In addition to the GO annotation, manual annota- 
tion of all 99 protein-coding unique sequences was per- 
formed. This was an important step, based on which addi- 
tional information about protein functions was retrieved 
from the literature. The entire set of annotated proteins 
was subdivided into 19 classes of biological processes: 
development, cell cycle regulation and apoptosis, repli- 
cation and repair, chromatin modification, transcription 
and post-transcriptional regulation, translation, protein 
folding, protein processing, modification and degrada- 
tion, signal transduction, stress responses, cytoskeleton 
organisation, carbohydrate metabolism, lipid metabo- 
lism, energy homeostasis, cofactor metabolism, amino 
acid catabolism, transport and membrane turnover and 
miscellanea (Supplementary data). 




■ Development 

■ Cell cycle regulation and apoptosis 
Replication and repair 

■ Chromatin modification 

■ Transcription and post-transcriptional regulation 
Translation 

■ Protein folding 

Protein processing, modification and degradation 

■ Signal transduction 

■ Stress response 

■ Cytoskeleton organisation 

■ Cdrtx)lrydrate metabolism 

■ Lipid metabolism 

■ Energy tiomeostasis 

■ Cofactor metabolism 

■ Amino acid catabolism 

■ Transport and membrane turnover 
Miscellanea (including unannotated proteins) 

Fig. 2: pie chart illustrating the distribution of annotated proteins in 
different classes after manual curation and classification. 



Protein length and exon composition of trans-spliced 
transcripts - To verify whether the trans-splicing mech- 
anism might be associated with genes containing small 
exons, as suggested by Davis et al. (1995), we calculated 
the length of all of the protein-coding sequences obtained 
from SchistoDB and of all 99 annotated protein-coding 
genes in our dataset. Based on this survey, we found that 
the proteins derived from trans-spliced transcripts av- 
eraged -400 amino acids in length. In comparison, the 
average length of all S. mansoni proteins obtained from 
SchistoDB was only slightly greater (-450 amino acids). 
We also estimated the number of exons per gene and 
the average exon length for all protein-coding sequenc- 
es from the parasite found in both SchistoDB and our 
dataset. Again, the analysis showed a conserved exon 
composition in the two sets of proteins, with an average 
number of six exons per protein and an average length of 
-73 amino acids per exonic region being observed. 

SL knockdown in S. mansoni sporocysts - In an at- 
tempt to disrupt the trans-splicing mechanism, we de- 
signed an siRNA targeting the S. mansoni SL sequence. 
Over a seven-day period of cultivation in the presence of 
the siRNA, we monitored cultured sporocysts for vari- 
ous phenotypes, including a decrease in the miracidial/ 
sporocyst transformation rate, mortality during the in 
vitro cultivation period and larval motility and length. 
Visual monitoring revealed that SL-siRNA treatment 
only altered the larval length phenotype, resulting in 
sporocysts with reduced size (Fig. 3A). To verify that 
this length alteration represented a significant effect, we 
measured captured images of live sporocysts and analy- 
sed the obtained data using Metamorph software. The 
average length measurements obtained for the sporo- 
cysts from the SL-siRNA treated-groups were signifi- 
cantly reduced compared to larvae from the control de- 
coy-siRNA-treated and blank groups (Fig. 3B). 

qPCR was performed to correlate the observed phe- 
notypes and gene expression patterns. Notably, the five 
target transcripts randomly chosen from known SL- 
sequence-containing genes in the enriched EST dataset 
exhibited significant reductions of at least 50% com- 
pared to the decoy-siRNA treatment control. The exam- 
ined calcium channel, ATPase inhibitor, phosphoserine- 
phosphohydrolase, thioredoxin and enolase transcripts 
displayed knockdown of 52%, 48%, 50%, 68% and 55%, 
respectively (Fig. 4). In addition, to check for nonspecif- 
ic (off-target) knockdown, non-trans-spliced transcripts 
were assessed to determine expression levels following 
SL-siRNA treatment. No significant alteration of tran- 
script levels was observed for SmZFl, SmRBx or SOD 
following SL-siRNA treatment. Thus, all of the tested 
trans-spliced transcripts analysed via RT-qPCR showed 
a similar decrease in the transcript expression level fol- 
lowing SL-siRNA treatment, suggesting a systemic 
trans- splicing knockdown effect. 

DISCUSSION 

The SL trans-splicing mechanism was first described 
as a post-transcriptional processing strategy for polycis- 
tronic transcripts in trypanosomatids (Agabian 1990). In 
subsequent years, this mechanism was observed in sev- 
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eral other organisms, but its functional role was never 
clearly defined outside the context of polycistronic tran- 
scription. One of the first hypotheses put forth to de- 
scribe this phenomenon was that trans-splicing could be 
functionally associated with specific genes or gene cate- 
gories. For example, in Ciona intestinalis, trans-splicing 
was suggested to predominantly regulate the expression 
of specific functional gene categories, such as the plas- 
ma and endomembrane system, Ca^* homeostasis and 
the actin cytoskeleton (Matsumoto et al. 2010). However, 
this hypothesis was not supported in S. mansoni as there 
was no clear evidence to show that the trans-splicing 
mechanism was linked to any particular gene category, 
biological process, molecular function, life-stage, sex, 
tissue or subcellular localisation of protein-coding tran- 
scripts (Davis et al. 1995). 

In the present study, we analysed a diverse set of 
transcripts that are subjected to trans-splicing during 
different S. mansoni life cycle stages. As it appears that 
approximately 11% of 5. mansoni transcripts are subject- 
ed to trans-splicing (Protasio et al. 2012) and given the 
complexity of the life cycle of this species, it is plausible 
that this process is important for the regulation of gene 
expression associated with parasite development and/or 
adaptation to different environments. On the other hand, 
the fraction of SL-containing transcripts reported thus 
far in S. mansoni is considerably lower compared to the 
high percentage of genes that undergo trans-splicing in 
organisms such as C. elegans andAscaris spp, which can 
be up to 70% and 90%, respectively (Allen et al. 2011). 

As previously stated, the EST dataset generated in 
this work was highly enriched in SL-containing se- 
quences, showing a 6,000-fold increase in SL-contain- 
ing sequences compared to the total number of S. man- 




Fig. 3: in vitro cultured Schistosoma mansoni larvae seven days 
post-double stranded RNA treatments. A, B: brightfield photomicro- 
graphs of in vitro cultured S. mansoni sporocysts after seven days 
of treatments with spliced leader (SL)-small interfering (siRNA) (A) 
compared to the control decoy-siRNA (B), illustrating the effects of 
the exposure to SL-siRNA on sporocyst lengths; C: graphic repre- 
sentation of sporocyst length measurements (nm) after seven days of 
siRNA treatment by scatter plots with the calculated median values 
indicated by the horizontal bars. The median values for siRNA treat- 
ments were compared to decoy-siRNA (grey plots) treatment control. 
All mesurements were statistically analysed using Mann-Whitney U 
test within each experiment. Asterisk means p < 0.0001. 



soni ESTs obtained from dbEST (60% SL-containing 
sequences were observed in our dataset vs. only 0.01% 
in dbEST). This group of SL-sequence-enriched tran- 
scripts represents a highly informative set of genes 
that could potentially shed light on several features of 
the trans-splicing mechanism. An interesting result ob- 
tained in this study comes from comparisons between 
our set of annotated trans-spliced sequences and those 
reported by Protasio et al. (2012). Approximately half of 
the protein-coding transcripts in our annotated dataset 
of 99 unique sequences were also included in the larger 
set generated by these authors. Two observations can be 
made from this comparison: (i) the strategy employed in 
the present study was appropriate for the investigation 
of SL transcripts, as demonstrated by the fact that 50 SL 
transcripts were identified in both datasets and (ii) be- 
cause our dataset contained unique protein-coding genes 
that are not found in the broader Protasio dataset, it can 
be inferred that our approach allowed the retrieval of 
different SL transcripts. The differences in the contents 
of these datasets could be explained by the applied meth- 
odologies, in that Protasio et al. (2012) used a "whole- 
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Fig. 4: histogram depicting the relative transcript levels of small inter- 
fering (siRNA)-treated sporocysts after seven days of exposure com- 
pared to the decoy-siRNA control. For each transcript tested, data is 
represented as mean fold-differences (± 2 standard error) relative to 
the decoy-small RNA control (1.00). Gray bars represent sporocyst 
messenger RNA levels showing consistent and statistically signifi- 
cant decrease of known trans-spliced transcripts [calcium channel/ 
decoy, p = 0.0056; adenosine triphosphate (ATP)ase inhibitor/de- 
coy, p = 0.0358; phosphoserine-phosphohydrolase/decoy, p = 0.0136; 
thioredoxin/decoy, p = 0.0358; enolase/decoy, p = 0.0189]. White bars 
represent relative transcript levels for non-trans-spliced transcripts in 
siRNA-treated sporocysts that showed no differences when compared 
to decoy-siRNA treated controls (SmZFl/decoy, p = 0.0755; SOD/de- 
coy, p = 0.8969; SmRBx/decoy, p = 0.0765). Transcript levels were 
determined by reverse transcriptase-quantitative polymerase chain 
reaction and data analysed using the AACt method followed by sta- 
tistical analysis using the Mann-Whitney U test. Significance levels 
were set at p < 0.05 (*). Data were generated from three independent 
experiments. The gray line represents the decoy-control level. 
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transcriptome" approach, whereas we employed a more 
selective protocol involving the capture and enrichment 
of SL transcripts prior to cloning and sequencing. 

Some of the transcripts in our S. mansoni dataset were 
previously described as SL-containing sequences, such 
as enolase and an ATPase inhibitor (Davis et al. 1995, 
Davis 1996). Although other protein-coding sequences 
described by Davis et al. (1995), such as synaptobrevin, 
a guanine nucleotide -binding protein and HMG-CoA 
reductase, were not identified within our dataset, se- 
quences related to these genes or the pathways in which 
they are involved (e.g., Golgi Snare betl, small GTPas- 
es or mevalonate pathway enzymes) were represented 
herein. Although a trans-spliced form of the glycolytic 
enzyme glyceraldehyde 3 -phosphate dehydrogenase 
(GAPDH) was not previously found in S. mansoni, this 
enzyme has been observed to undergo trans-splicing in 
Caenorhabditis spp. As reported herein, we also found 
a trans-spliced GAPDH transcript. This discrepancy be- 
tween studies may be explained by the hypothesis that 
a transcript regulated by trans-splicing is not necessar- 
ily always processed by this mechanism. In other words, 
any given transcript may or may not be subjected to 
transplicing, which could account for the suggested role 
of trans-splicing as a mechanism for gene expression 
modulation and coordination (Davis et al. 1995). 

In this work, we found S. mansoni orthologues of 
genes from different functional categories that had pre- 
viously been described to undergo trans-splicing in oth- 
er organisms. These genes encode ribosomal proteins, 
small nuclear ribonucleoproteins, members of the solute 
carrier family, GAPDH, thioredoxin, a mitochondrial 
ribosomal protein component, a WD-repeat containing 
protein, a peptidyl prolyl cis-trans isomerase, serine/ 
threonine kinases and a cAMP-dependent protein kinase 
(Davis et al. 1995, Davis 1996). These findings support 
the hypothesis that there is some conservation among the 
genes regulated by trans-splicing, indicating that some 
genes may have maintained trans-splicing as a form of 
post-transcriptional regulation throughout evolution. 

Bachvaroff and Place (2008) showed that the SL 
trans-splicing of dinoflagellate transcripts is correlated 
with their expression levels, suggesting that highly ex- 
pressed genes are more likely to be SL trans-spliced. 
This correlation was made by comparing the levels of 
SL trans-spliced transcripts with the abundance of the 
corresponding proteins, as estimated through proteomic 
analyses (Beranova-Giorgianni 2003). Accordingly, we 
observed that at least 25% of the protein-coding tran- 
scripts that we classified as undergoing trans-splicing in 
S. mansoni encode proteins identified in previous pro- 
teomic studies (Curwen et al. 2004, Knudsen et al. 2005, 
Cass et al. 2007, Wu et al. 2009, Mathieson & Wilson 
2010, Castro-Borges et al. 2011). These proteins include 
some glycolytic enzymes and many ribosomal proteins 
that we identified as TS transcripts in the present dataset. 
This observation is an indication of the importance of the 
trans-splicing mechanism among different organisms, as 
it could contribute to increasing protein abundance. 

One of the trans-spliced transcripts of particular in- 
terest identified in this study is ubiquinol-cytochrome C 



reductase complex ubiquinol binding protein (UbCRBP), 
which has been previously described as the first cistron 
of a trans-spliced polycistronic transcript, in which 
only the second gene (enolase) undergoes trans-splicing 
(Agabian 1990). The UbCRBP transcript also has been 
reported to undergo trans-splicing in Echinococcus mul- 
tilocularis (Brehm et al. 2000), reinforcing the idea that 
trans-splicing is a conserved mechanism among selected 
orthologous genes. Our findings also revealed that the 
insertion of the SL in the S. mansoni UbCRBP sequence 
occurred before the second exon of the gene, which also 
contains an upstream AG acceptor splicing signal (Fig. 
5). This result suggests that trans-splicing may generate 
alternatively spliced products, in which different exons 
could potentially receive the SL sequence. Alternative 
splice sites were previously observed in the 3-hydroxy-3 
-methyl-glutaryl CoA reductase transcript of 5. mansoni, 
in which the third exon accepts the SL sequence (Rajk- 
ovic et al. 1990). Thus, alternative trans-splicing appears 
to be a conserved mechanism in this parasite, suggesting 
a unique means of expanding the protein repertoire of 
this organism. Whether alternative trans-splicing is con- 
fined to polycistronic transcripts is unknown. 

Previous studies have suggested that short exons in 
pre-mRNAs are more prone to undergo trans-splicing 
(Davis et al. 1995). However, we did not observe any dif- 
ferences in exon size when we compared our dataset to the 
whole set of 5. mansoni genes. Additionally, the observed 
protein lengths and the number of exons per sequence 
were equivalent when our dataset was compared to the 
set of all protein-coding sequences from SchistoDB. 

Interestingly, many transcripts encoding proteins in- 
volved in the spliceosome machinery appear to undergo 
trans-splicing. This observation indicates that the trans- 
splicing mechanism may be self-regulated, which would 
represent a unique characteristic of this mechanism. In 
addition to transcripts encoding spliceosome proteins, 
the eukaryotic translation initiation factor 4e-binding 
protein and a subunit of the eukaryotic translation ini- 
tiation factor 3 transcripts were also shown to undergo 
trans-splicing. These proteins are involved in a mecha- 
nism that enables the efficient translation of trimethyl- 
guanosine-capped mRNAs in nematodes (Wallace et al. 
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Fig. 5: representative scheme of an alternative spliced leader (SL) 
trans-splicing, as was observed in the case of ubiquinol-cytochrome 
C reductase complex ubiquinol binding protein (UbCRBP). While in 
normal transcription all exons are present in the final transcript, in 
alternative SL trans-splicing, the SL insertion is between the first and 
second exons, yielding a shortened transcript with a missing exon. 
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2010) and they are of great importance for trans-splicing 
because one of the functions attributed to this mechanism 
is facilitation of the translation of transcripts containing 
this modified cap. 

To our knowledge, this is the first report describing 
an attempt to disrupt the trans-splicing mechanism in 
a metazoan using RNAi to assess its regulatory func- 
tion. The introduction of SL-siRNA to in vitro-cultured 
sporocysts resulted in a phenotype characterised by a re- 
duction of larval size. Because a large variety of SL-con- 
taining genes may have been affected by RNAi knock- 
down, it is difficult to speculate how this size reduction 
phenotype occurred. This phenotype may have resulted 
from a metabolic imbalance caused by a decrease in a 
large number of different trans-spliced transcripts. Pro- 
teins associated with crucial metabolic processes may 
have been affected by the knockdown of the trans-splic- 
ing mechanism, thereby resulting in a systemic decrease 
in metabolism, leading to possible parasite starvation 
and decreased larval length. Apart from the previous 
discussion of the occurrence of trans-splicing in gly- 
colytic transcripts, other affected processes could also 
account for the diminished size of sporocysts following 
knockdown, for example, involving proteins associated 
with the cell cycle, metabolic pathways other than gly- 
colysis and morpho-proteins, such as those described in 
our results. This phenotype may also reflect a type of 
stress caused by decreased activity of the trans-splicing 
mechanism. Taking these findings together, we can infer 
that the parasites subjected to knockdown were not in 
physiological equilibrium and that growth impairment 
is a common consequence of systemic stress and starva- 
tion, which could be caused by the reduced expression of 
transcripts under trans-splicing control in S. mansoni. 

Although lethality was not observed after seven days 
of SL knockdown, our attempt to silence the trans-splic- 
ing machinery decreased the expression of SL-containing 
transcripts by 60%. It is likely that this partial knockdown 
at the mRNA expression level may have exerted only a 
minor effect on intact larvae, not only because transcripts 
were still present in these parasites, but also because their 
encoded proteins may have persisted for an extended peri- 
od of time, depending on their turnover rate. Thus, the re- 
maining transcript levels and residual protein pools were 
most likely sufficient to maintain larval viability, even 
though the larvae appeared to be morphologically stunt- 
ed. Another possible explanation is that only a fraction of 
the molecules produced from a given transcript undergo 
trans-splicing. Interestingly, all of the tested trans-spliced 
transcripts exhibited a similar decrease at the transcript 
expression level, suggesting a systemic trans-sphcing 
knockdown effect following SL-siRNA treatment. Be- 
cause 11% of the S. mansoni transcript population appears 
to be trans-spliced, a hypothesis explaining the limited 
transcript knockdown observed could include saturation 
of the components of the RNAi machinery. 

In this study, we generated and analysed a diverse set 
of S. mansoni ESTs that were highly enriched in tran- 
scripts bearing the SL sequence. In agreement with the 
literature, the SL sequence-containing transcripts were 
not found to be associated with specific gene categories. 



subcellular localisations or life cycle stages within the 
transcript dataset we analysed. We also investigated pro- 
tein lengths, the number of exons and exon length among 
the SL-containing transcripts and found no differences 
compared to the entire set of S. mansoni transcripts. 
Disruption of the SL trans-splicing mechanism in S. 
mansoni sporocysts through RNAi resulted in a reduc- 
tion of larval size. This result provides evidence of the 
importance of this mechanism for the development of 
this organism and suggests a crucial role for the regu- 
lation of metabolic processes by SL trans-splicing. To 
determine whether the SL trans-splicing mechanism has 
a unique ancestral origin or multiple unrelated origins, 
we searched for homologous proteins in other organisms 
in which the transcripts were also trans-spliced. This 
search provided support for the hypothesis of the ori- 
gin of this mechanism in a common ancestor, although 
further analyses are needed. The association of SL tran- 
scripts with a wide range of different genes suggests that 
this mechanism plays an important regulatory role, in- 
fluencing the expression levels of different proteins as 
well as the protein repertories observed in different life 
stages and under distinct environmental conditions. To 
our knowledge, this is the most comprehensive survey 
of SL transcripts conducted in schistosomes to date and 
our results provide a valuable resource for further stud- 
ies addressing the mechanisms of SL trans-splicing at 
both the biological and molecular levels. 
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Primers used in reverse transcriptase-quantitative polymerase chain reaction experiments for testing the levels of trans-spliced transcripts after RNA interference treatment 



Protein 



Protein ID 



Forward primer spliced leader 



Reverse primer 



Trans-spliced transcripts 
(positive trans-splicing control) 

High voltage-activated calcium channel-Cav B2 

Phosphoserine phosphohydrolase-like 

ATPase inhibitor-like 

Thioredoxin precursor 

Enolase 

Constitutive transcripts 
(endogenous normalisation control) 

a-tubulin 

Non-transpliced transcripts 
(negative trans-splicing control) 

SmZFl 

Superoxide dismutase 1 
SmRing box 



AY277532.1 
SMU30262 
U30266.1 

AY948980.1 
SMU30175 



XM 002579987.1 



AF316827.1 
M27529.1 
DQ466078.1 



GTCACGGTTTTACTCTTGT 
GTCACGGTTTTACTCTTGT 
GTCACGGTTTTACTCTTGT 
GTCACGGTTTTACTCTTGT 
GTCACGGTTTTACTCTTGT 



CAAATGGGAAATGCTTGTTG 



ACTTCTCTCAGAAATCCAGCCT 

GATCCGGCTATTGCTTCATT 
GGCATCCCTCAGTGAGAATAA 



GGCAAAATCGCTACCTGAAT 
GTCCAAACAGACGCATTTGA 
GAACCACCAGCTTCACGAAT 
AAGCAGACGTATTGAAGGGAAA 
TCCACCTCAACTGTGGGATT 



TGAACGAGTCATCACCACCT 



TGGAGAGGATTATACAATCTGGTT 
CTGCCACGCTTCCATTAAC 
TCCGGCAGATAGCACACAATTA 
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2 



Class 

Development 



Cell cycle regulation 
and apoptosis 



Replication and repair 



Chromatin modification 



Transcription and 

post-transcriptional 

regulation 



Translation 



Protein folding 

Protein processing, 
modification and degradation 



Protein name (SchistoDB ID) 

bola-like protein my016 (012050) 
cactin (038640) 
chibby protein pkdl interactor (035010) 

chll helicase (130210) 
enhancer of rudimentary protein (175210) 
jun activation domain binding protein (190100) 
programmed cell death protein (067490) 
pten-related phosphatase (012400) 
regulator of chromosome condensation-related (074990) 

cript-related (132860) 
endonuclease III (006710) 
pol-related (186960.2) 
histone hl/h5 (003770) 
histone deacetylase hda2 (138770) 
mrg-binding protein (130630) 

DNA-directed rna polymerase I (072800) 
Isml, putative (175550) 
nucleotide binding protein 2 (154750) 
pre-mRNA splicing factor (080110) 
ribonucleic acid binding protein 81(070200) 
small nuclear ribonucleoprotein Ula, U2b (069880) 
serine/arginine rich splicing factor (113620.4) 
eukaryotic translation initiation factor 4e-binding protein (074390) 
eukaryotic translation initiation factor 3 subunit (158500) 
mitochondrial ribosomal protein L24 (035410) 
mitochondrial ribosomal protein S33 (039980) 
translation machinery-associated protein (169920) 
peptidyl-prolyl cis-trans isomerase-hke 3 (080150) 
rotamase (160790) 
26S proteasome non-atpase regulatory subunit (181380) 
ammd-like (peptidase) (074940) 
aspartic proteinase (013040.1) 
beta-l,4-galactosyltransferase (153110) 

glycosyltransferase (046880.2) 
methionine aminopeptidase (011120) 
o-sialoglycoprotein endopeptidase (012450) 
protein-l-isoaspartate o-methyltransferase (058140.2) 
peptidase (165910) 
ring finger (028430) 



Protein function 

Morphogenesis, cell growth 
Morphogenesis, cell growth and survival 
Morphogenesis, cell growth 
Chromosome segregation 
Pyrimidine synthesis and cell cycle 
Activation of mitotic checkpoint 
Cell regulation and apoptosis 
Cell regulation and apoptosis 
Chromosome condensation 
DNA double strand break repair 
DNA repair of damaged pyrimidine 
DNA replication 

Nucleosome formation 
Histone deacetylation 
Histone acetylation 

rRNA synthesis 
RNA splicing (snRNP formation) 
RNA processing, traffic and stability (snRNP formation) 
RNA splicing 
RNA traffic and surveillance 
RNA splicing (spliceosome protein) 
RNA splicing (spliceosome protein) 
Inhibition of protein translation 

mRNA-ribosome interaction 
Mitochondrial protein synthesis 
Mitochondrial protein synthesis 

Translation 
Acceleration of protein folding 
Acceleration of protein folding 
Protein degradation via proteasome 
Protein degradation 
Protein degradation 
Glycosyltransferase activity 
Glycosyltransferase activity 
Removal of initial methionine 
Sialoglycoprotein degradation 
Proteins repair and degradation 
Protein degradation 
Protein ubiquitination 
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Class 



Protein name (SchistoDB ID) 



Protein function 



Signal transduction 



Stress response 
Cytoskeleton organisation 



Carbohydrate metabolism 



Lipid metabolism 



Energy homeostasis 



hybrid protein kinase, other group, WNK family (154440) 
protein phosphatase 2C (133060.3) 
ral guanine nucleotide dissociation stimulator (154920) 
rab 15, 13, 10, 1, 35, 5 (072290) 
rho GDP-dissociation inhibitor-related (045610) 
serine/threonine kinase, CAMK group, CAMKL family, AMPK subfamily (142990) 
serine/threonine kinase, CMGC group, RCK family, MAK subfamily (132890) 
tbcl domain family member (152820) 
glutaredoxin (006550.2) 
hsp70-interacting protein (064860) 
thioredoxin mitochondrial type (037530) 
beta-parvin-related (133200) 
cofilin, actophorin (120700.2) 
dynactin subunit 3 (dynactin light chain p24) (136990) 
nuclear movement protein nude (103320) 
tektin (059600) 
tropomodulin (003710) 
tropomyosin, putative (031770.4) 
enolase (024110) 
glyceraldehyde-3-phosphate dehydrogenase (056970.1) 
1-lactate dehydrogenase (038950) 
ribulose-5-phosphate-3-epimerase (104580.2) 
l-acylglycerol-3 -phosphate o-acyltransferase (000070) 

acetyl-CoA C-acyltransferase (129330) 
acyl-coenzyme A binding domain containing (147930) 
arvl (021280) 
serine palmitoyltransferase I (028080) 
3-dehydroecdysone 3alpha-reductase-related (042680) 
isopentenyl-diphosphate delta isomerase (130430) 
sterol reductase-related (124300) 
atpase inhibitor (023010) 
ATP synthase delta chain (082120) 
cytochrome C oxidase copper chaperone (029150) 
cytochrome C subunit I 
ectonucleotide pyrophosphatase/phosphodiesterase (104270) 
hexaprenyldihydroxybenzoate methyltransferase (029060) 
ubiquinone biosynthesis protein (106190) 
ubiquinol-cytochrome C reductase complex ubiquinol binding protein (024110) 



Chloride ion traffic 
Serine/threonine phosphatase 
GTPase activation 
Small GTPase 
Interaction with Rho GTPases 
Signal transduction pathways for cellular energy homeostasis 
Signal transduction pathways for cell cycle regulation 
Small GTPase activation 
Antioxidant defence 
Protection against physiologic stress 
Antioxidant defence 
Regulation of cell adhesion and cytoskeleton organisation 
Actin binding 
Microtubule and dynein binding 
Dynein binding 
Structural component of microtubules 
Actin growth regulation 
Actin mechanics regulation 
Glycolysis 
Glycolysis 
Glycolysis 
Pentose phosphate pathway 
Lipid metabolism 
Lipid catabolism 
Lipid metabolism 
Sterol homeostase 
Sphingolipids biosynthesis 
Ecdysteroids inactivation 
Biosynthesis of isoprenoids 

Biosynthesis of steroids 
Inhibition of ATP synthesis 
Catalysis of ATP synthesis 
Involved in ATP synthesis 
Involved in ATP synthesis 
ATP hydrolysis 
Ubiquinone and ATP synthesis 
Ubiquinone and ATP synthesis 
Involved in ATP synthesis 
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Class 

Cofactor metabolism 

Amino acid catabolism 

Transport and 
membrane turnover 



Miscellanea 

(including unannotated proteins) 



Protein name (SchistoDB ID) 

molybdopterin-binding (126650) 
uroporphyrinogen-III synthase (079840) 
4-hydroxyphenylpyruvate dioxygenase (007960) 
valacyclovir hydrolase (193750) 
fatty acid binding protein (174440.4) 
mitochondrial carrier protein-related (058130) 
nuclear transport factor (037700) 
peripheral-type benzodiazepine receptor, putative (102510) 
solute carrier family 35 member CI, putative (155830) 
B-cell receptor-associated protein-like protein (175660) 
dynamin-associated protein (135790) 
golgi snare betl-related (130940) 
rabb and c (169460) 
small VCP/p97-interacting protein (194560) 
vacuolar ATP synthase proteolipid subunit 1, 2, 3 (004310.1) 
amyloid beta A4 protein related (144990) 
autophagy protein 16-like (135430) 
metal dependent hydrolase-related (024490) 
protein C14orf 153 precursor (147820) 
Sj-Tsl (123270) 
tetraspanin D76 (041460) 
WD repeat protein SLl-17 (049520) 



Protein function 

Molybdopterin cofactor synthesis 
Haeme synthesis 
Tyrosine catabolism 
Alpha-amino acid catabolism 
Fatty acids transport 
Transport across mitochondrial membrane 
Proteins transport 
Transport of cholesterol, porphyrin and anions 
Transport of GDP-fucose 
Vesicular transport of proteins between the ER and the Golgi 
Vesicular transport 
Vesicular transport between the ER and the Golgi 
Vesicular transport of proteins between the membrane and the Golgi 
ER integrity maintenance 
Transport across membranes and regulation of internal pH 
Involved in several physiological processes 
Involved in cellular autophagy 
Presents hydrolase activity 
Unannotated protein 
Unannotated transmembrane protein 
Involved in cell adhesion 
Involved in several physiological processes 



transcripts were classified according to their biological function after manual annotation. ATP: adenosine triphosphate; ER: endoplasmic reticulum; GDP: guanidine 5'-diphosphate- 
beta-1; mRNA: messenger RNA; rRNA: ribosomal RNA; snRNP: small nuclear ribonucleoprotein. 



Detailed annotation of all proteins derived from trans-spliced transcripts 

classified in this study 



Protein Name 
bola-like protein my016 



GO annotation 

0004129 c)^ochrome-c oxidase activity 



Life-cycle Phase 

Adult worms of mixed 
gender, Schistosomula, 
Female worm, Male 
worm 



Localization 

Cytoplasm (Psort) 



Description 

The morpho-protein BolA from Escherichia coli is related to round morphology and may be 
involved in switching the cell between elongation and septation systems during cell division. The 
expression of BoLA is growth rate regulated and is induced during the transition into the the 
stationary phase. BolA is also induced by stress during early stages of growth and may have a 
general role in stress response (InterPro IPR002634). 



Protein Name 

cactin-related 



Life-cycle Phase 

Male worms 



Localization 

Nucleus (Psort and 
GO) 



GO annotation 

0000398 nuclear mRNA splicing, via spliceosome 

0005634 nucleus 

0005681 spliceosomal complex 

0006397 mRNA processing 

0008380 RNA splicing 

007 1013 catalj^ic step 2 spliceosome 



Description 

Cactin specifically interacts with the cactus protein, having a possible function in the Rel pathway 
controlling the formation of dorsal-ventral embryonic polarity. In mammals, Rel transcription 
factors control the immune and inflammatory responses, cell growth and survival, differentiation 
of vertebrate limbs and muscle development. In Drosophila it is involved with the establishment of 
dorsal-ventral polarity in the early embryo. Cactin seems to act through the degradation of cactus, 
liberating the dorsal protein to translocate to the nucleus and regulating transcription of target 
genes. A cactin homologous in Caenorhabditis elegans is required for correct migration of distal 
tip cells and for gonad morphogenesis. 



Protein Name 

chibby protein pkd2 interactor, putative 



Life-cycle Phase 

Adult worms of mixed 
gendes, Female worms, 
Male worms 



Localization 

Mitochondria (Psort) 



GO annotation 

null 



Description 

Chibby is a nuclear protein that directly interacts with beta-catenin and seems to inhibit beta- 
catenin-mediated transcriptional activation. Beta-catenin is a subunit of the cadherin complex and 
has been implicated as a component of the Wnt signaling pathway. 



Protein Name 

chll helicase homo log, putative 



Life-cycle Phase 

Schistosomula 



Localization 

Nucleus (Psort and 
GO) 



GO annotation 

0000166 nucleotide binding 

0003676 nucleic acid binding 

0003677 DNA binding 

0004003 ATP-dependent DNA helicase activity 
0004386 helicase activity 
0005524 ATP binding 
0005634 nucleus 

0006 139 nucleobase-containing compound metabolic process 
0008026 ATP-dependent helicase activity 
0016787 hydrolase activity 

0016817 hydrolase activity, acting on acid anhydrides 

0016818 hydrolase activity, acting on acid anhydrides, in phosphorus-containing anhydrides 
Description 

Chll helicases are nuclear proteins that present an ATP binding motif, which is essencial for its 
function in chromossome segregation. ATPases/DNA helicases may alter DNA topology to allow 
loading of proteins involved in maintaining sister chromatid cohesion. Saccharomyces cerevisiae 
chll mutants have a significant increase in the rate of chromosome missegregation. 



Protein Name 

enhancer of rudimentary protein, putative 



Life-cycle Phase 

All 



Localization 

Cytoplasm (Psort) 



GO annotation 

0007049 cell cycle 



Description 

The Drosophila protein 'enhancer of rudimentary' is a small protein whose function is not yet clear. 
From an evolutionary point of view, it is highly conserved and has been found to exist in probably 
all multicellular eukaryotic organisms. It has been proposed that this protein has functions 
implicated in pyrimidine biosynthesis and the cell cycle (InterPro IPR000781). 



Protein Name 

jun activation domain binding protein, putative 



Life-cycle Phase 

Male worms 



Localization 

Extracellular (Psort) 



GO annotation 
null 

Description 

Jun activation domain binding protein (Jab) 1 is required for the hj^erphosphorylation of 53BP1 
upon mitotic stress conditions and is involved in proper activation of the mitotic checkpoint 
mechanism. Jun proteins are a component of the activator protein 1 (API) complex, which 
regulates gene expression in response to a variety of stimuli, including c5^okines growth factors, 
stress, and bacterial and viral infections. API in turn controls a number of cellular processes 
including differentiation, proliferation and apoptosis. 



Protein Name 

programmed cell death protein, putative 



Life-cycle Phase 

Adult worms of mixed 
gender. Egg 



Localization 

Mitochondria (Psort) 



GO annotation 

0005509 calcium ion binding 



Description 

Involved in the programmed cell death (apoptosis) pathways. 



Protein Name 

pten-related phosphatase (plip) 



Life-cycle Phase 

Adult worms of mixed 



Localization 

C)^oplasm, 



EC: 3.1.3.16 3.1.3.48 



gender, Female worms Extracellular (Psort) 



GO annotation 

0004725 protein tyrosine phosphatase activity 

0006470 protein dephosphorylation 

0008138 protein tyrosine/serine/threonine phosphatase activity 

00 1 63 1 1 dephosphorylation 

0016787 hydrolase activity 

0016791 phosphatase activity 

0035335 peptidyl-tyrosine dephosphorylation 



Description 

PTEN (Phosphatase and tensin homolog) is related to the cell cycle regulation and apoptosis. The 
loss of just one copy of this gene has been shown to trigger uncontrolled cell growth. The protein 
encoded by this gene is a phosphatidylinositol-3,4,5-trisphosphate 3 -phosphatase. It contains a 
tensin-like domain as well as a catalytic domain similar to that of the dual specificity protein 
tyrosine phosphatases. Unlike most of the protein tyrosine phosphatases, this protein preferentially 
dephosphorylates phosphoinositide substrates. It negatively regulates intracellular levels of 
phosphatidylinositol-3,4,5-trisphosphate in cells and functions as a tumor suppressor by negatively 
regulating the Akt/PKB signaling pathway. 



Protein Name 

regulator of chromosome condensation-related 

GO annotation 

null 



Life-cycle Phase 

Female worms 



Localization 

Nucleus (Psort) 



Description 

This protein is involved in chromossome condensation, which is a crucial step in the preparation of 
cells for division. 



Protein Name 

cript-related 



Life-cycle Phase 

Female worms, male 
worms 



Localization 

Nucleus (GO) 



GO annotation 

0005634 nucleus 

0006355 regulation of transcription, DNA-dependent 
0043 1 89 H4/H2A histone acetyltransferase complex 

Description 

This protein contains a 5'-t)T0syl DNA phosphodiesterase domain (TDP2). In human, TDP2 is 
required for the efficient repair of topoisomerase Il-induced DNA double strand breaks. The 
topoisomerase is covalently linked by a phosphotyrosyl bond to the 5'-terminus of the break. TDP2 
cleaves the DNA 5'-phosphodiester bond and restores 5 '-phosphate termini, needed for subsequent 
DNA ligation, and hence repair of the break. This family belongs to the large 
exonuclease/endonuclease/phosphatase superfamily that contains functionally diverse enzymes 
that share a common catalytic mechanism of cleaving phosphodiester bonds. 



Protein Name 

endonuclease III, putative 
EC: 3.2.2 



Life-cycle Phase 

Schistosomula 



Localization 

Nucleus, Cytoplasm 
(Psort) 



GO annotation 

0003677 DNA binding 

0003824 catalytic activity 
0004519 endonuclease activity 
0006281 DNA repair 



0006284 base-excision repair 






0051539 4 iron, 4 sulfur cluster binding 






Description 






Endonuclease III plays a role in the release of damaged pjoimidines from double-stranded DNA 


and the subsequent cleavage of the 3' from the apurinic site. 




Protein Name 


Life-cycle Phase 


Localization 


pol-related 


Adult worms of mixed 


Nucleus, Cytoplasm 




gender 


(Psort) 


GO annotation 






0006259 DNA metabolic process 






0008 1 50 biological process 






0008152 metabolic process 






0009058 biosynthetic process 






Description 






Pol, is a reverse transcriptase, a DNA polymerase enzyme that transcribes single-stranded RNA 


into single-stranded DNA. It is also a DNA-dependent DNA polymerase which synthesizes the 


second strand of DNA of the double helix DNA once the RNA has been reverse transcribed. 


Protein Name 


Life-cycle Phase 


Localization 


histone hl/h5, putative 


Egg 


Nucleus (Psort and 






GO) 


GO annotation 






0000786 nucleosome 






0003677 DNA binding 






0005634 nucleus 






0005694 chromosome 






0006334 nucleosome assembly 






Description 






Histone proteins have central roles in both chromatin organization (as 


structural units of the 


nucleosome) and gene regulation (as dynamic components that have a 


direct impact on DNA 


transcription and replication). The core histone octamer is formed by a central H3-H4 tetramer and 


two flanking H2A-H2B dimers. In addition, there is a linker histone, called HI (or H5 in avian 


species) which is crucial to the higher-order organization of the chromatin. This enzyme, present in 


all multicellular organisms is the most divergent one, with cell-type and stage- specific variation 


(InterProIPR005818). 






Protein Name 


Life-cycle Phase 


Localization 


histone deacetylase hda2, putative 


Male worms 


Nucleus (Psort) 


EC: 3.5.1.98 






GO annotation 






0006464 protein modification process 






0006810 transport 






0006950 response to stress 






0006996 organelle organization 






0007010 cytoskeleton organization 






0007154 cell communication 






0007165 signal transduction 






0008 1 50 biological process 






0008152 metabolic process 






0008219 cell death 







0009056 catabolic process 

0009058 biosynthetic process 

0009605 response to external stimulus 

0015031 protein transport 

0016043 cellular component organization 

00 1 953 8 protein metabolic process 

0044238 primary metabolic process 

0050789 regulation of biological process 

Description 

HDA2 and HDA3 associate with HDAl to generate the activity of the HDAl histone deacetylase 
complex. HDAl interacts with itself and with the HDA2-HDA3 subcomplex to form a probable 
tetramer and these interactions are necessary for catalytic activity. The HDAl histone deacetylase 
complex is responsible for the deacetylation of lysine residues on the N-terminal part of the core 
histones (H2A, H2B, H3 and H4). Histone deacetylation gives a tag for epigenetic repression and 
plays an important role in transcriptional regulation, cell cycle progression and developmental 
events (InterPro IPR021006). 



Protein Name 

mrg-binding protein, putative 



Life-cycle Phase 

Female worms, Male 
worms 



Localization 

Nucleus (Psort) 



GO annotation 

0006139 nucleobase-containingcompoundmetabolicprocess 

0006351 transcription, DNA-dependent 

0008150 biological process 

0008152 metabolic process 

0009058 biosynthetic process 

0044238 primary metabolic process 

0050789 regulation of biological process 

0006139 nucleobase-containing compound metabolic process 

Description 

Component of a histone acetyltransferase complex which is involved in transcriptional activation 
of specific genes by acetylation of nucleosomal histones H4 and H2A. This modification may both 
alter nucleosome-DNA interactions and promote interaction of the modified histones with other 
proteins which positively regulate transcription. This complex may be required for the activation 
of transcriptional programs associated with oncogene and proto-oncogene mediated growth 
induction, tumor suppressor mediated growth arrest and replicative senescence, apoptosis, and 
DNA repair. The complex may also play a direct role in DNA repair when recruited to sites of 
DNA damage (UniProt Q9NV56). 



Protein Name 

DNA-directed ma poljmierase I, putative 
GO annotation 

0006351 transcription, DNA-dependent 



Life-cycle Phase 

Egg, Male worms 



Localization 

Nucleus / Extracellular 
(Psort) 



Description 

RNA polymerase I, one of three nuclear DNA-directed RNA polymerases found in all eukaryotes, 
is a multisubunit complex which typically produces rRNAs. Two large subunits comprise the most 
conserved portion including the catalytic site and share similarity with other eukaryotic and 
bacterial multisubunit RNA polymerases. The remainder of the complex is composed of smaller 
subunits (generally ten or more), some of which are also found in RNA polymerase III and others 
of which are also found in RNA polymerases II and III. Although the core is competent to mediate 



ribonucleic acid synthesis, it requires additional factors to select the appropriate template. 



Protein Name 
Isml, putative 



GO annotation 

null 



Life-cycle Phase 
Adult worms of mixed 

gender, Egg, 
Schistosomula, Female 
worms, Male worms 



Localization 

Mitochondria (Psort) 



Description 

U6 snRNA-associated Sm-like protein Lsml is part of the Sm-like proteins class, which were 
identified in a variety of organisms based on the presence of the Sm sequence motif. The Sm-like 
proteins are thought to form a stable heteromer present in tri-snRNP particles, which are important 
for pre-mRNA splicing. 



Protein Name 

nucleotide binding protein 2 (nbp 2), putative 
GO annotation 

0000166 nucleotide binding 
0003676 nucleic acid binding 
0005524 ATP binding 



Life-cycle Phase 

Female worms 



Localization 

Nucleus (Psort) 



Description 

This protein contains two domains, the RRM and the MRP domains. RRM (RNA recognition 
motif) is a highly abundant eukaryotic domain, found in proteins involved in post-transcriptional 
gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. 
The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), 
proteins implicated in regulation of alternative splicing, and protein components of small nuclear 
ribonucleoproteins (snRNPs). MRP (Multiple Resistance and pH adaptation) is a homologue of the 
Fer4_NifH superfamily. Like the other members of the superfamily, MRP contains an ATP-binding 
domain at the N-termini. It is found in bacteria as a membrane-spanning protein and fiinctions as a 
Na+/H+ antiporter. 



Protein Name 

pre-mRNA splicing factor, putative 



Life-cycle Phase 

Egg, Schistosomula, 
Female worms 



Localization 

Cytoplasm (Psort) 



GO annotation 

null 



Description 

Pre-mRNA splicing is a predominantly co-transcriptional event which involves a large number of 
essential splicing factors. 



Protein Name 

ribonucleic acid binding protein SI, putative 



Life-cycle Phase 

Female worms 



Localization 

Cj^oplasm, Nucleus 
(Psort) 



GO annotation 

0006 139 nucleobase-containing compound metabolic process 
0008150 biological process 
0008152 metabolic process 

Description 

This gene encodes a protein that is part of a post-splicing multiprotein complex involved in both 
mRNA nuclear export and surveillance. mRNA surveillance detects exported mRNAs with 



truncated open reading frames and initiates nonsense-mediated mRNA decay (NMD). When 
translation ends upstream from the last exon-exon junction, this triggers NMD to degrade mRNAs 
containing premature stop codons. This serie-rich protein binds to the mRNA and remains bound 
after nuclear export, acting as a nucleo-cytoplasmic shuttling protein. 



Protein Name 

small nuclear ribonucleoprotein Ula, U2b, 
putative 



Life-cycle Phase 

Adult worms of mixed 
gender, Schistosomula, 
Female worms 



Localization 

Nucleus (Psort) 



GO annotation 

0000166 nucleotide binding 
0003676 nucleic acid binding 
0030529 ribonucleoprotein complex 



Description 

Small nuclear ribonucleoproteins are RNA-protein complexes that combine with unmodified pre- 
mRNA and various other proteins to form a spliceosome. 



Protein Name 

serine/arginine rich splicing factor, putative 



Life-cycle Phase 

Adult worms of mixed 
gender. Egg, 
Schistosomula 



Localization 

Mitochondria (Psort) 



GO annotation 

0000166 nucleotide binding 
0003676 nucleic acid binding 



Description 

Serine/arginine-rich splicing factors play an important role in constitutive and alternative splicing 
as well as during several steps of RNA metabolism. 



Protein Name 

eukaryotic translation initiation factor 4e- 
binding protein, putative 

GO annotation 

0006412 translation 

0008 1 50 biological process 

0008152 metabolic process 

0009058 biosjoithetic process 

0019538 protein metabolic process 

0044238 primary metabolic process 

0050789 regulation of biological process 



Life-cycle Phase 

Egg 



Localization 

Cytoplasm, Nucleus 
(Psort) 



Description 

This protein interacts with the eukaryotic translation initiation factor 4e, which binds to the cap 
structure of mRNAs during an early step of protein synthesis initiation and facilitates ribosome 
binding by inducing the unwinding of mRNA secondary structures. The binding protein prevents 
the translation factor to assemble its complex and therefore inhibits protein synthesis (UniProt 
Q13541). 



Protein Name 

eukaryotic translation initiation factor 3 subunit, 
putative 

GO annotation 

null 



Life-cycle Phase 

Female worms, Male 
worms 



Localization 

Plasma membrane 
(Psort) 



Description 

Eukaryotic initiation factor 3 is a multi-subunit complex that is required for binding of mRNA to 
40S ribosomal subunits, stabilization of ternary complex binding to 40S subunits, and dissociation 
of 40S and 60S subunits (InterPro IPR007783). 



Protein Name 

mitochondrial ribosomal protein L24, putative 

GO annotation 

0006412 translation 



Life-cycle Phase 

Adult worms of mixed 
gender, Female worms 



Localization 

Mitochondria, 
C)^oplasm (Psort) 



Description 

Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein 
synthesis within the mitochondria. Among different species, the proteins comprising the 
mitoribosome differ greatly in sequence, and sometimes in biochemical properties. Mitochondrial 
ribosomal protein L24 sequence analysis identified two transcript variants that encode the same 
protein in humans. 



Protein Name 

mitochondrial ribosomal protein S3 3, putative 

GO annotation 

0005840 ribosome 



Life-cycle Phase 

Egg, Male worms 



Localization 

Nucleus (Psort) 



Description 

Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein 
synthesis within the mitochondria. Among different species, the proteins comprising the 
mitoribosome differ greatly in sequence, and sometimes in biochemical properties. 



Protein Name 

Translation machinery-associated protein, 
putative 



Life-cycle Phase 

Adult worms of mixed 
gender 



Localization 

Nucleus (Psort) 



GO annotation 

null 



Description 

Involved in protein translation. 



Protein Name 

peptidyl-prolyl cis-trans isomerase-like 3, 
putative (ppiase) (rotamase) (cyclophilin-like 
protein ppil3) (c5^101) 
EC 5.2.1.8 



Life-cycle Phase 

Egg, Schistosomula, 
Female worms, Male 
worms 



Localization 

C)^oplasm (Psort) 



GO annotation 

0006139 
0006464 

0008150 
0008152 
0019538 



n 

nucleobase-containing compound metabolic process 
protein modification process 

biological process 
metabolic process 
protein metabolic process 



Description 

This enzyme acts accelerating protein folding by catalysing the cis-trans isomerisation of proline 
imidic peptide bonds in oligopeptides. It also has chaperone-like functions and is the major high- 
affinity binding proteins for the immunosuppressive drug cyclosporin A (CSA) in vertebrates. 
Cyclophilins are found in all prokaryotes and eukaryotes, and have been structurally conserved 
throughout evolution, implying their importance in cellular function. They share a common 



domain with isomerase and additional domains unique to each member of the family, which are 
important for selection of protein substrates and subcellular compartmentalization (InterPro 
IPR002130). 



Protein Name 

rotamase, putative 
EC: 5.2.1.8 

GO annotation 

0006464 protein modification process 
0008 1 50 biological process 
0008152 metabolic process 
00 1 953 8 protein metabolic process 



Life-cycle Phase 

Adult worms of mixed 
gender, Egg, Female 
worms 



Localization 

Membrane (Psort) 



Description 

This protein is involved in the interconversion of cis and trans isomers of peptide bonds involving 
proline residues. 



Protein Name 

26S proteasome non-atpase regulatory subunit, 
putative 



Life-cycle Phase 

Adult worms of mixed 
gender. Female worms. 
Egg 



Localization 

Nucleus (Psort) 



GO annotation 

0000502 proteasome complex 



Description 

In eukaryotic cells, most proteins in the cytosol and nucleus are degraded via the ubiquitin- 
proteasome pathway. The 26S proteasome is a molecular machine built from approximately 31 
different subunits, which catalyzes protein degradation. 



Life-cycle Phase 

Female worms 



Localization 

Cytoplasm, 
Cj^oskeleton (Pfam) 



Protein Name 

Ammd-like protein 
EC: 3.4.14.4 

GO annotation 

0008152 metabolic process 
0016787 hydrolase activity 

Description 

Annotated as "family S9 non-peptidase homologue", it is homologous to Clonorchis sinensis 
dipeptidyl-peptidase III. The enzyme is classified as a hydrolase acting on peptide bonds 
(peptidase), involved in the release of an N-terminal dipeptide from a peptide comprising four or 
more residues. Human dipeptidyl-peptidase III is a member of the S9B family in clan SC of the 
serine proteases. This cytoplasmic protein binds a single zinc ion and has post-proline dipeptidyl 
aminopeptidase activity. Both Schistosoma mansoni and C. sinensis enzymes have a conserved a 
esterase/lipase domain, the presence of which could be explained from the close relationship 
between lipases and peptidases. 



Protein Name 

aspartic proteinase, putative 



Life-cycle Phase 

Schistosomula 



Localization 

Extracellular / ER 
(Psort) 



GO annotation 

0004190 aspartic-type endopeptidase activity 
0006508 proteolysis 
0008233 peptidase activity 



0016787 hydrolase activity 
Description 

Many parasites which cause serious human or livestock diseases produce aspartic proteinases 
which perform critical functions for parasite viability. A Schistosoma japonicum aspartic 
proteinase was characterized and shown to be closely related to the cathepsin D, a lysosomal 
aspartic protease of mammals. The main function of cathepsin D is the degradation of endocytosed 
proteins. Aspartic proteinase activity is present in extracts of adult 5*. japonicum and S. mansoni 
and in culture media in which schistosomes were maintained and was capable of digesting 
hemoglobin. The schistosome aspartic proteinase may play a pivotal role in the catabolism of 
hemoglobin obtained from host erythrocytes. 



Protein Name 

beta- 1 ,4-galactosyltransferase, putative 
EC: 2.4.1 

GO annotation 

0005975 carbohydrate metabolic process 
0016740 transferase activity 
0016757 



Life-cycle Phase 

Female worms 



Localization 

Golgi /Cytoplasmic 
(Psort) 



transierase activity 

transferase activity, transferring glycosyl groups 



Description 

Beta-l,4-galactosyhansferase is a trans-Golgi glycosyltransferase that participates in the synthesis 
of Gal betal-4-GlcNAc-disaccharide (type 2 disaccharide determinants) unit of glycoconjugates. 
Glycosyltransferases acts in the transfer of a monosaccharide unit from an activated nucleotide 
sugar to another molecule. The result of a glycosyl transfer can be a carbohydrate, a glycoside, an 
oligo- or a polysaccharide. Some glycosyltransferases catalyse transfer to Pi or water. Glycosyl 
transfer can also occur to protein residues or lipids (forming glycolipids). The glycosylation of 
proteins can help during its folding and increase structural stability. Glycosylation may also play a 
role in cell-cell adhesion. Conformational changes in beta4Gal-Tl also creates the binding site for 
a mammary gland-specific protein, alpha-lactalbumin, which changes the acceptor specificity of 
the enzyme toward glucose to sjmthesize lactose during lactation in mammals. 



Life-cycle Phase 

Schistosomula 



Localization 

Membrane (Psort) 



Protein Name 

glycosyltransferase, putative 
EC: 2.4.1 

GO annotation 

0006464 protein modification process 
0006629 lipidmetabolic process 
0008 1 50 biological process 
0008152 metabolic process 
0009058 biosynthetic process 

Description 

Glycosyltransferases acts in the transfer of a monosaccharide unit from an activated nucleotide 
sugar to another molecule. The result of a glycosyl transfer can be a carbohydrate, a glycoside, an 
oligo- or a polysaccharide. Some glycosyltransferases catalyse transfer to Pi or water. Glycosyl 
transfer can also occur to protein residues or lipids (forming glycolipids). The glycosylation of 
proteins can help during its folding and increase structural stability. Glycosylation may also play a 
role in cell-cell adhesion. 



Protein Name 

methionine aminopeptidase, putative 
EC: 3.4.11.18 



Life-cycle Phase 

Egg 



Localization 

C)^oplasm (Psort) 



GO annotation 

0004177 aminopeptidase activity 

0006508 proteolysis 

0008233 peptidase activity 

0008235 metalloexopeptidase activity 

0009987 cellular process 

0016787 hydrolase activity 

0046872 metal ion binding 

Description 

This enzyme is responsible for the removal of the amino-terminal (initiator) methionine from 
nascent eukaryotic cytosolic and cytoplasmic prokaryotic proteins if the penultimate amino acid is 
small and uncharged (InterPro IPR001714). 



Protein Name 

o-sialoglycoprotein endopeptidase, putative 
EC: 3.4.24.57 



Life-cycle Phase 

Male worms 



Localization 

Extracellular (Psort) 



GO annotation 

0004222 metalloendopeptidase activity 
0006508 proteolysis 



Description 

This enzyme catalyzes the hydrolysis of 0-sialoglycoproteins. A sialoglycoprotein is 
combination of sialic acid and glycoprotein, which is, itself, a combination of sugar and protein. 



Protein Name 

protein-l-isoaspartate o-methyltransferase, 

putative 

EC: 2.1.1.77 



Life-cycle Phase 

Adult worms of mixed 
gender 



Localization 

Cytoplasm (Psort) 



GO annotation 

0006464 protein modification process 
0008 1 50 biological process 
0008 1 52 metabolic process 



Description 

Is an enzyme that catalyses the transfer of a methyl group from S-adenosylmethionine to the free 
carboxyl groups of D-aspartyl or L-isoaspartyl residues in a variety of peptides and proteins. The 
enzyme does not act on normal L-aspartyl residues L-isoaspartyl and D-aspartyl are the products 
of the spontaneous deamidation and/or isomerisation of normal L-aspartyl and L-asparaginyl 
residues in proteins. This enzyme plays a role in the repair and/or degradation of these damaged 
proteins; the enzymatic methyl esterification of the abnormal residues can lead to their conversion 
to normal L-aspartyl residues (InterPro 1PR000682). 



Protein Name 

peptidase, putative 



Life-cycle Phase 

Adult worms of mixed 
gender. Female worms 



Localization 

Nucleus (Psort) 



GO annotation 

null 



Description 

This enzjmie is involved in protein degradation pathways, acting in the cleavage of peptide bonds. 



Protein Name 
ring finger, putative 

GO annotation 



Life-cycle Phase 

Male worms 



Localization 

Extracellular (Psort) 



0008270 

0046872 



zinc ion binding 

metal ion binding 



Description 

RING (Really Interesting New Gene) finger domain is a protein structural domain of zinc finger 
type which contains a Cys^HisCys^ amino acid motif which binds two zinc cations. Many proteins 

containing a RING finger play a key role in the ubiquitination pathway. 



Protein Name 

Hybrid Protein Kinase_Other Group_WNK 

Family 

EC: 2.7.1.37 



Life-cycle Pliase 

Schistosomula 



Localization 

Cytoplasm, Nucleus 
(Psort) 



GO annotation 

0004672 protein kinase activity 

0004674 protein serine/threonine kinase activity 

0005524 ATP binding 

0006468 protein phosphorylation 

0016301 kinase activity 

0016310 phosphorylation 

0016772 transferase activity, transferring phosphorus-containing groups 

00 1 688 1 acid-amino acid ligase activity 



Description 

WNKs are highly conserved serine/threonine kinases that comprise a unique branch of the kinome. 
Members of this family are components of a signaling pathway that coordinates Cl~ influx and 
efflux to dynamically regulate intracellular Cl~ activity. 



Protein Name 

protein phosphatase 2c, putative 
EC: 3.1.3.16 



Life-cycle Phase 

Adult worms of mixed 
gender 



Localization 

Cytoplasm (Psort) 



GO annotation 

0000287 magnesium ion binding 

0003824 catalytic activity 

0004721 phosphoprotein phosphatase activity 

0004722 protein serine/threonine phosphatase activity 
0006470 protein dephosphorylation 

0008287 protein serine/threonine phosphatase complex 

0016787 hydrolase activity 

0030145 manganese ion binding 

0046872 metal ion binding 



Description 

Protein phosphatase 2C (PP2C) is one of the four major classes of mammalian serine/threonine 
speciflc protein phosphatases. Its exact physiological role is still unclear. Despite its evolutionary 

conservation, with isozymes in C. Elegans, Arabdopsis thaliana, Leishmania, among other 
organisms, PP2C does not seem to be evolutionary related to the main family of serine/threonine 
phosphatases. However, it is significantly similar to the catalytic subunit of pyruvate 
dehydrogenase phosphatase. Phosphatases remove a phosphate group fi-om its substrate by 
hydrolysing phosphoric acid monoesters into a phosphate ion and a molecule with a free hydroxyl 
group. Phosphatases act in opposition to kinase/phosphorylases, which add phosphate groups to 
proteins. The addition of a phosphate group may activate/de-activate an enzyme or enable a 
protein-protein interaction to occur. Therefore phosphatases are integral to many signal 
transduction pathways. 



Protein Name 

ral guanine nucleotide dissociation stimulator, 
ralgds, putative 



Life-cycle Phase 

Female worms 



Localization 

Cytoplasm, Nucleus 
(Psort) 



GO annotation 

0005085 guanyl-nucleotide exchange factor activity 
0005622 intracellular 

0007264 small GTPase mediated signal transduction 

005 1056 regulation of small GTPase mediated signal transduction 



Description 

Stimulates the dissociation of GDP from the Ras-related Ras-like protein A (RalA) and RalB 
GTPases which allows GTP binding and activation of the GTPases (UniProt Q 12967). RalA and 
RalB function as the critical GTP sensors for GTP-dependent exocytosis. 



Protein Name 

rabl5, 13, 10, 1, 35, 5, putative 



Life-cycle Phase 

Adult worms of mixed 
gender. Female worms 



Localization 

C)^oplasm (Psort) 



GO annotation 

0000 1 66 nucleotide binding 
0005488 binding 
0005525 GTP binding 

0007264 small GTPase mediated signal transduction 
0015031 protein transport 

Description 

This enzyme is a member of the small GTPase superfamily (Ras superfamily), which comprises 
monomeric GTP -binding proteins with molecular masses usually in the range of 20-25 kDa. The 
small GTPase superfamily has been found in all eukaryotic organisms. Small GTPases function as 
molecular switches in intracellular signaling to control a wide variety of cellular functions. In 
general the GDP-bound form is the inactive configuration of the molecular switch, while the GTP- 
bound form is active. The switch is activated by the exchange of GDP for GTP, catalyzed by 
guanine nucleotide exchange factors, in response to a variety of upstream signal signals. Activated 
GTPases then interact with one or more effector proteins, leading to activation of downstream 
signaling pathways. GTPase activating proteins stimulate an otherwise low intrinsic GTPase 
activity, reverting the conformation back to the inactive GDP-bound form. The process of 
activation and inactivation is repeated as long as the upstream stimulatory signal is present. 



Protein Name 

rho GDP-dissociation inhibitor-related 



Life-cycle Phase 

Male worms 



Localization 

Cytoplasm (Psort and 
GO) 



GO annotation 

0005094 Rho GDP-dissociation mhibitor activity 
0005737 cytoplasm 



Description 

Rho GDP-dissociation inhibitor (RhoGDI) plays an essential role in control of a variety of cellular 
functions through interactions with Rho family GTPases, including Racl, Cdc42, and RhoA. 
RhoGDI might play a role in the development of drug resistance in cancer cells. Overexpression of 
RhoGDl in cancer cells increased resistance to apoptosis induction, probably due to the 
inactivation of Racl degradation, since this protein is required for apoptosis in response to 
cytotoxic drugs. Taken together, the data show that RhoGDI is an antiapoptotic molecule that 
mediates cellular resistance to chemotherapy agents. 



Protein Name 



Life-cycle Phase 



Localization 



Serine/Threonine Kinase_CAMK 
Group_CAMKL Family AMPK Subfamily 
EC: 2.7.11.1 



Egg, Schistosomula, 
Female worms 



Cytoplasm, 
Mitochondria (Psort) 



GO annotation 

0000166 nucleotide binding 

0003824 catalytic activity 

0004672 protein kinase activity 

0004674 protein serine/threonine kinase activity 

0005524 ATP binding 

0006468 protein phosphorylation 

0008081 phosphoric diester hydrolase activity 

0016301 kinase activity 

0016310 phosphorylation 

0016772 transferase activity, transferring phosphorus-containing groups 

0046872 metal ion binding 

Description 

The role of AMPK in regulating cellular energy charge places this enzyme at a central control 
point in maintaining energy homeostasis. More recent evidence has shown that AMPK activity can 
also be regulated by physiological stimuli, independent of the energy charge of the cell, including 
hormones and nutrients. Once activated, AMPK-mediated phosphorylation events switch cells 
from active ATP consumption to active ATP production. 



Protein Name 

Serine/Threonine Kinase CMGC Group RCK 
Family MAK Subfamily 
EC: 2.7.11.1 



Life-cycle Phase 

Male worms 



Localization 

Cj^oplasm (Psort) 



GO annotation 

0000166 nucleotide binding 

000 1750 photoreceptor outer segment 

0003677 DNA binding 

0004672 protein kinase activity 

0004674 protein serine/threonine kinase activity 

0004693 cyclin-dependent protein kinase activity 

0005524 ATP binding 

0006366 transcription from RNA polymerase II promoter 

0006468 protein phosphorylation 

0007049 cell cycle 

0016301 kinase activity 

0016310 phosphorylation 

0016740 transferase activity 

0016772 transferase activity, transferring phosphorus-containing groups 

0045494 photoreceptor cell maintenance 

Description 

CMGC kinases include cyclin-dependent, MAP, glycogen synthase 3 and CK2-related kinases. 
MAK are serine/threonine protein kinases related to kinases involved in cell cycle regulation. 



Protein Name 

tbcl domain family member, putative 
GO annotation 

0005097 Rab GTPase activator activity 



Life-cycle Phase 

Female worms 



Localization 

Cytoplasm (Psort) 



0005622 intracellular 

00323 1 3 regulation of Rab GTPase activity 

003285 1 positive regulation of Rab GTPase activity 



Description 

This is a widespread domain present in Gyp6 and Gyp7, thereby giving rise to the notion that it 
performs a GTP-activator activity on Rab-like small GTPases. The mouse TBCl gene is presumed 
to have a role in regulating cell cycle control and differentiation. 



Protein Name 

glutaredoxin, grx, putative 
EC: 1.8.4.1 



Life-cycle Phase 

Adult worms of mixed 
gender, Female worms, 
Male worms 



Localization 

C)^oplasm (Psort) 



GO annotation 

0009055 electron carrier activity 

0015035 protein disulfide oxidoreductase activity 

0045454 cell redox homeostasis 

Description 

Glutaredoxins are electron carriers in the glutathione-dependent sjmthesis of deoxyribonucleotides. 
Moreover, it has an important role in the antioxidant defence. 



Protein Name 

heat shock protein 70 (hsp70)-interacting 
protein, putative 



Life-cycle Phase 

Egg 



Localization 

Cytoplasm (Psort) 



GO annotation 

0005488 binding 
0006950 response to stress 



Description 

CHIP (C-terminus of Hsp70-interacting protein) is a cochaperone and ubiquitin ligase that interacts 
with Hsp70 through an amino-terminal tetratricopeptide repeat (TPR) domain. Prominent among 
the cellular functions mediated by CHIP is protection against physiologic stress and inhibition of 
apoptosis. 



Protein Name 

thioredoxin m(mitochondrial)-type, putative 
EC: 1.11.1.15 



Life-cycle Phase 

Female worms, Male 
worms 



Localization 

Cytoplasm, 
Mitochondria (Psort) 



GO annotation 

0006662 glycerol ether metabolic process 
0009055 electron carrier activity 
0015035 protein disulfide oxidoreductase activity 
0045454 cell redox homeostasis 

Description 

Thioredoxin is a class of small redox proteins known to be present in all organisms. Although not 
entirely understood, thioredoxin plays a central role in humans as antioxidants, through their 
response to reactive oxygen species. It has also recently been found to play a role in cell-to-cell 
communication. Mitochondrial thioredoxin systems are critical in protecting cells from 
mitochondria-dependent ROS and apoptosis. 



Protein Name 

beta-parvin-related 


Life-cycle Phase 

Adult worms of mixed 
gender. Female worms 


Localization 

Cytoskeleton, 
Mitochondria (Psort) 


GO annotation 



0004252 serine-type endopeptidase activity 

0006508 proteolysis 



Description 

Probably plays a role in the regulation of cell adhesion and cytoskeleton organization (UniProt 
Q9HBI1). Beta-parvin, a focal adhesion and cytoskeleton associated protein, is a member of the 
parvin-multigene family. It consists of an a-actinin-like actin binding domain. Beta-parvin 
functions as a linker between important cytoskeletal proteins and is involved in many crucial 
cellular processes ranging from cell adhesion, spreading and proliferation, to deposition of 
extracellular matrix proteins. 



Protein Name 

cofilin, actophorin, putative 

GO annotation 

0003779 actin binding 
0005622 intracellular 



Life-cycle Phase 

Adult worms of mixed 
gender 



Localization 

Nucleus (Psort) 



Description 

Actophorin is a member of the ADF/cofilin family that binds F-actin. 



Protein Name 

Dynactin subunit 3 (Djmactin light chain p24), 
putative 



Life-cycle Phase 

All 



Localization 

Mitochondria (Psort) 



GO annotation 

null 



Description 

Dynactin binds to both microtubules and cytoplasmic dynein. It is involved in a diverse array of 
cellular functions, including ER-to-Golgi transport, the centripetal movement of lysosomes and 
endosomes, spindle formation, cj^okinesis, chromosome movement, nuclear positioning, and 
axonogenesis. 



Protein Name 

nuclear movement protein nude, putative 



Life-cycle Phase 

Adult worms of mixed 
genders. Egg, Male 
worms 



Localization 

Cytoplasm (Psort) 



GO annotation 

null 



Life-cycle Phase 

Female worms 



Localization 

C)^oplasm (Psort) 



Description 

NudC is a nuclear movement protein that associates with dynein forming a biochemical complex 
with components of the dynein/dynactin complex. NudC plays multiple roles in mitosis and 
cytokinesis, in part by regulating microtubule organization. 

Protein Name 
tektin, putative 

GO annotation 

0000226 microtubule cj^oskeleton organization 
0005874 microtubule 

Description 

Tektins are cytoskeletal proteins found in cilia and flagella as structural components of outer 
doublet microtubules. They are also present in centrioles and basal bodies. They are polymeric in 
nature, and form filaments. The amino acid sequences of tektins are well conserved in nature, from 



Life-cycle Phase 

Female worms 



Localization 

Cytoskeleton (GO) 



C. elegans to Homo sapiens. 

Protein Name 
tropomodulin, putative 

GO annotation 

0005523 tropomyosin binding 
0005856 cytoskeleton 

Description 

Tropomodulin binds and caps the minus end of actin, regulating the length of actin filaments 

Protein Name 

tropomyosin, putative 

GO annotation 

0005523 tropomyosin binding 
0005856 cytoskeleton 



Life-cycle Phase 

Female worms 



Localization 

Cytoplasm (Psort) 



Description 

Tropomyosin is an actin-binding protein that regulates actin mechanics. 



Protein Name 

Enolase (2-phosphoglycerate dehydratase) (2- 
phospho-D-glycerate hydro-lyase), putative 



Life-cycle Phase 

Adult worms of mixed 
gender, Egg, Female 
worms, Male worms 



Localization 

Cytoplasm (GO and 
Psort) 



GO annotation 

0000015 phosphop)nTivate hydratase complex 

0000287 magnesium ion binding 

0004634 phosphopyruvate hydratase activity 

0005737 cytoplasm 

0006096 glycolysis 

0016829 lyase activity 

0046872 metal ion binding 

Description 

In the glycolysis, it catalyses the conversion of phosphoenolpyruvate in to 



Protein Name 

Glyceraldehyde-3 -phosphate dehydrogenase 
(GAPDH) (Major larval surface antigen) (P-37), 
putative 
EC: 1.2.1.12 



Life-cycle Phase 

Egg 



2-phosphoglycerate. 

Localization 

Cytoplasm (Psort) 



GO annotation 

0000166 nucleotide binding 

0004365 glyceraldehyde-3 -phosphate dehydrogenase (NAD+) (phosphorylating) activity 

0005737 cytoplasm 

0006006 glucose metabolic process 

0006096 glycolysis 

0016491 oxidoreductase activity 

0050661 NADP binding 

0051287 NAD binding 

0055 1 14 oxidation-reduction process 

0016620 oxidoreductase activity, acting on the aldehyde or oxo group of donors, NAD or 
NADP as acceptor 

Description 



This enzyme catalyzes the sixth step of glycolysis, the breakdown of glucose yielding energy and 
carbon molecules. GAPDH has recently been implicated in several non-metabolic processes, 
including transcription activation, initiation of apoptosis and ER-Golgi vesicle shuttling. 



Protein Name 

1-lactate dehydrogenase, putative 
EC: 1.1.1.27 



Life-cycle Phase 

Egg 



Localization 

Cytoplasm (Psort) 



GO annotation 

0000166 nucleotide binding 
catalytic activity 
L-lactate dehydrogenase activity 
cytoplasm 

carbohydrate metabolic process 
glycolysis 

oxidoreductase activity 
cellular carbohydrate metabolic process 
oxidation-reduction process 

oxidoreductase activity, acting on the CH-OH group of donors, NAD or NADP as 



0003824 
0004459 
0005737 
0005975 
0006096 
0016491 
0044262 
0055114 
0016616 
acceptor 

Lactate dehydrogenase is a highly conserved enzyme that catalyzes the interconversion of pyruvate 
and lactate with concomitant interconversion of NADH and NAD+. It converts pyruvate, the final 
product of glycolysis, to lactate when oxygen is absent or in short supply. 



Protein Name 

ribulose-5-phosphate-3-epimerase, putative 
EC: 5.1.3.1 



Life-cycle Phase 

Female worm 



Localization 

Cytoplasm (Psort) 



GO annotation 

0003824 
0004750 
0005975 
0008152 
0016853 
0046872 



catalytic activity 

ribulose-phosphate 3-epimerase activity 
carbohydrate metabolic process 
metabolic process 
isomerase activity 
metal ion binding 



Description 

This is the enzyme that converts D-ribulose 5-phosphate into D-xylulose 5-phosphate in the 
pentose phosphate pathway which generates NADPH and pentoses. The primary outcomes of this 
pathway are NADPH used in reductive biosynthetic reactions and building blocks for nucleotides, 
amino acids and nucleic acids. 



Protein Name 

1 -acylglycerol-3-phosphate o-acyltransferase, 

putative 

EC: 2.3.1.51 



Life-cycle Phase 

Male worms 



Localization 

Membrane (GO) 



GO annotation 



0003841 
0008152 
0008654 
0016020 
0016740 
0016746 

Description 



l-acylglycerol-3 -phosphate O-acyltransferase activity 

metabolic process 

phospholipid biosynthetic process 

membrane 

transferase activity 

transferase activity, transferring acyl groups 



This transferase participates in three metabohc pathways: glycerolipid metabolism, 
glycerophosphoUpid metabolism and ether lipid metabolism. The animal enzyme is specific for the 

transfer of unsaturated fatty acyl groups. 

(http ://www. llamp.net/ sites/default/files/pictures/recycling_of_phospholipids . svg) . 



Protein Name 

acetyl-CoA C-acyltransferase, putative 
EC: 2.3.1.16 



Life-cycle Phase 

Male worms 



Localization 

Mitochondria or 
Peroxisome, Membrane 
(Psort) 



GO annotation 

0003824 catalytic activity 

0003985 acetyl-CoA C-acetyltransferase activity 

0003988 acetyl-CoA C-acyltransferase activity 

0008152 metabolic process 

0008270 zinc ion binding 

0016740 transferase activity 

0016746 transferase activity, transferring acyl groups 

0016747 transferase activity, transferring acyl groups other than amino-acyl groups 
0046872 metal ion binding 

Description 

This enzyme catalyzes the final step of the beta oxidation, which is the cleavage of 3-ketoacyl 
CoA. Beta-oxidation is the process by which fatty acids are broken down in the mitochondria 
and/or peroxisomes to generate acetyl-CoA. Acetyl-CoA is the entry molecule for the Citric Acid 
Cycle, which is used by all aerobic organisms to generate energy fi-om carbohydrates, fats and 
proteins. In addition, the cycle provides biosynthetic precursors for some amino acids and NADH, 
which is used in numerous biochemical reactions. 



Protein Name 

acyl-coenzyme A binding domain containing, 
putative 



Life-cycle Phase 

Female worms 



Localization 

Cytoskeleton (Psort) 



GO annotation 

0000062 fatty-acyl-CoA binding 
0005488 bindmg 

Description 

Acyl-coenzyme A binding proteins (ACBP) bind thiol esters of long fatty acids and coenzyme A in 
a one-to-one binding mode with high specificity and affinity. Acyl-CoAs are important 
intermediates in fatty lipid s5Tithesis and fatty acid degradation and play a role in regulation of 
intermediary metabolism and gene regulation. The suggested role of ACBP is to act as a 
intracellular acyl-CoA transporter and pool former. ACBPs are present in a large group of 
eukaryotic species and several tissue-specific isoforms have been detected. 

This enzyme is closely related to ACBD6 proteins, which preferentially bind unsaturated long- 
chain acyl-CoAs (NCBI CDD). 



Protein Name 

arvl, putative 



Life-cycle Phase 

Female worms 



Localization 

Membrane (Psort) 



GO annotation 

0006355 regulation of transcription, DNA-dependent 
0008270 zinc ion binding 



Description 

Arvl is a transmembrane protein, with potential zinc-binding motifs, that mediates sterol (a class 
of lipids) homeostasis, preventing free sterol toxicity. In yeast, Arvlp is important for the delivery 



of an early GPI (glycosylphosphatidylinositol) intermediate to the first mannosyltransferase of GPI 
synthesis in the ER lumen. It is important for the traffic of sterol in yeast and in humans. In 
eukaryotic cells, it may fuction in the sphingolipid metabolic pathway as a transporter of ceramides 
between the ER and Golgi (InterPro IPR007290). Sterol homeostasis in eukaryotic cells relies on 
the reciprocal interconversion of free sterols and steryl esters, which constitute an important 
storage form for fatty acids and sterols that are deposited in intracellular lipid particles (Eukaryotic 
sterol homeostasis: Steryl ester hydrolases in Saccharomyces cerevisiae). 



Protein Name 

serine palmitoyltransferase I, putative 
EC: 2.3.1.50 



Life-cycle Phase 

Egg 



Localization 

Cytoplasm, Peroxisome 
(Psort) 



GO annotation 

0003824 catalytic activity 
0009058 biosynthetic process 

0016740 transferase activity 

0016769 transferase activity, transferring nitrogenous groups 
0030 1 70 pyridoxal phosphate binding 



Description 

This enzjane catalyzes a key step in the biosjTithesis of sphingosine which is a precursos of many 
other sphingolipids. Ceramides, sphingomyelins, glycosphingolipids and gangliosides are the main 
types. Sphingolipids are commonly believed to protect the cell surface against harmful 
environmental factors by forming a mechanically stable and chemically resistant outer leaflet of 
the lipid bilayer. Certain complex glycosphingolipids were found to be involved in specific 
functions, such as signal transmission and cell recognition. Cell recognition depends mainly on the 
physical properties of the sphingolipids, whereas signaling involves specific interactions of the 
glycan structures of glycosphingolipids with similar lipids present on neighboring cells or with 
proteins. Recently, relatively simple sphingolipid metabolites have been shown to be important 
mediators in the signaling cascades involved in apoptosis, proliferation and stress. Ceramide-based 
lipids self-aggregate in cell membranes and form microdomains known as lipid rafts, which were 
proposed to sort membrane proteins along the cellular pathways of membrane transport. 
Sphingolipids synthesis begins in the ER and is completed in the Golgi. 



Protein Name 

3 -dehydroecdy sone 3 alpha-reductase-related 



Life-cycle Phase 

Adult worms of mixed 
gender, Female worms, 
Male worms 



Localization 

Membrane (Psort) 



GO annotation 

0000166 nucleotide binding 

0004316 3-oxoacyl-[acyl-carrier-protein] reductase (NADPH) activity 
0008152 metabolic process 
0016491 oxidoreductase activity 
0055 114 oxidation-reduction process 

Description 

One route of ecdysteroids inactivation in insects involves the conversion of ecdysone into 3- 
dehydroecdysone, followed by its reduction to 3-epicdysone, catalyzed by 3-dehydroecdysone 3 
alpha reductase. This enzyme is a novel eukaryotic member of the dehydrogenase/reductases 
superfamily. Ecdysteroids have been detected in trematode parasites and in several species of 
gastropod snails. There are potential roles of these and other hormones as regulators of 
host/parasite interactions between Schistosoma mansoni and Biomphalaria glabrata. Beta- 
ecdysterone was found to be effective in stimulating host location activities in S. mansoni 
miracidia and in enhancing growth and egg production in B. glabrata. However, plant extracts rich 



in ph5^oecdysteroids were not attractive to miracidia and did not affect growth and egg production 
in snails. The potential role of ecdysteroids in the process of parasitism and snail physiological 
processes is discussed. Plants rich in phytoecdysteroids do not appear to be of value in process 
which may interfere with the host parasite interactions in schistosomes (Regulation of ecdysteroid 
signalling: molecular cloning, characterization and expression of 3-dehydroecdysone 3 alpha- 
reductase, a novel eukaryotic member of the short-chain dehydrogenases/reductases superfamily 
from the cotton leafworm, Spodoptera littoralis and Ecdysteroids as regulators of host and parasite 
interactions: a study of interrelationships between Schistosoma mansoni and the host snail, 
Biomphalaria glabrata). 



Protein Name 

isopentenyl-diphosphate delta isomerase, 

putative 

EC: 5.3.3.2 



Life-cycle Phase 

Adult worms of mixed 
gender. Female worms 



Localization 

Cytoplasm, Nucleus 
(Psort and GO) 



GO annotation 

0000287 magnesium ion binding 

0004452 isopentenyl-diphosphate delta-isomerase activity 

0004743 pjTuvate kinase activity 

0005634 nucleus 

0006096 glycolysis 

0006352 transcription initiation, DNA-dependent 

0008299 isoprenoid biosynthetic process 

0016787 hydrolase activity 

0016853 isomerase activity 

0030955 potassium ion binding 



Description 

This enzyme catalyzes the conversion of the relatively un-reactive isopentenyl pyrophosphate 
(IPP) to the more-reactive electrophile dimethylallyl pjn-ophosphate. This reaction is a key step in 
the biosynthesis of isoprenoids through the highly conserved mevalonate pathway. Isoprenoids 
play widely varying roles in the physiological processes of plants and animals. Their function 
includes pigments, fragances, vitamins and precursos of sex hormones. 



Life-cycle Phase 

Egg 



Localization 

Membrane (Psort) 



Protein Name 

sterol reductase-related 

EC: 1.3.1.71 

GO annotation 

0004872 receptor activity 
0016020 membrane 

Description 

This oxidoreductase acts on the CH-CH group of donor with NAD+ or NADP+ as acceptor. This 
enzyme participates in the biosjnithesis of steroids such as cholesterol, estradiol and testpsterone. 



Protein Name 

atpase inhibitor, putative 



Life-cycle Phase 

All 



Localization 

Mitochondria (GO) / 
Cj^oplasm (Psort) 



GO annotation 

0004857 enzyme inhibitor activity 
0005739 mitochondrion 

0043086 negative regulation of catalj^ic activity 

0045980 negative regulation of nucleotide metabolic process 

Description 



It is thought to inhibit ATP synthesis by preventing the release of ATP. The ATP s)aithase inhibitor 
prevents the enzyme from switching to ATP hydrolysis during collapse of the electrochemical 
gradient, for example during oxygen deprivation. The inhibitor has two oligomeric states, dimer 
(the active state) and tetramer. At low pH, the inhibitor forms a dimer via antiparallel coiled coil 
interactions between the C-terminal regions of two monomers. At high pH, the inhibitor forms 
tetramers and higher oligomers by coiled coil interactions involving the N terminus and inhibitory 
region, thus preventing the inhibitory activity (InterPro IPR007648). 



Protein Name 

ATP synthase delta chain, mitochondrial, 

putative 

EC: 3.6.3.14 



Life-cycle Phase 

Egg 



Localization 

Mitochondria 



GO annotation 

00 15986 ATP sjoithesis coupled proton transport 

0045261 pro ton- transporting ATP synthase complex, catalytic core F(l) 
0046933 hydrogen ion transporting ATP synthase activity, rotational mechanism 
0046961 proton-transporting ATPase activity, rotational mechanism 



Description 

Mitochondrial ATP sjoithase catalyzes ATP s)nithesis, utilizing an electrochemical gradient of 
protons across the irmer membrane during oxidative phosphorylation. 



Protein Name 

cytochrome C oxidase copper chaperone, 
putative 

GO annotation 

0004129 cytochrome-c oxidase activity 



Life-cycle Phase 

All 



Localization 

Mitochondria, 
Extracellular (Psort) 



Description 

This metallochaperone is essential for the assembly of functional cytochrome c oxidase (CCO) and 
for delivery of copper ions to the mitochondrion for insertion into the enzjmie (InterPro 
IPR007745). Cytochrome c oxidase is a large transmembrane protein complex found in bacteria 
and, in eukaryotic mithocondria that plays a fundamental role in the electron transport chain. 
Therefore, the metallochaperone is involved with the generation of a proton gradient that 
culminates with ATP formation. 



Protein Name 

ectonucleotide 

pyrophosphatase/phosphodiesterase, putative 
EC: 3.1.4 



Life-cycle Phase 

Adult worms of mixed 
gender 



Localization 

Membrane (Psort) 



GO annotation 

0003824 catalytic activity 
0008152 metabolic process 

Description 

Involved primarily in ATP hydrolysis at the plasma membrane. Plays a role in regulating 
pyrophosphate levels, and functions in bone mineralization and soft tissue calcification (Uniprot 
P22413). 



Protein Name 

hexaprenyldihydroxybenzoate 
methyltransferase, putative 
EC: 2.1.1.114 



Life-cycle Phase 

Male worms 



Localization 

Mitochondria (Psort 



GO annotation 



0006744 ubiquinone biosynthetic process 

0008152 metabolic process 

0008168 methyltransferase activity 

0008425 2-polyprenyl-6-methoxy-l,4-benzoquinone methyltransferase activity 

0016740 transferase activity 

0032259 methylation 



Description 

This transferase participates in the biosynthesis of ubiquinone (coenzyme QIO). This oil-soluble, 
vitamin-like substance is present in most eukaryotic cells, primarily in the mithocondria. It is a 
component of the electron transport chain and participates in aerobic cellular respiration, 
generating energy in the form of ATP. 



Protein Name 

ubiquinone biosjoithesis protein, putative 



Life-cycle Phase 

All 



Localization 

C)^oplasm, 
Extracellular (Psort) / 
Mitochondria (GO) 



GO annotation 

0005739 mitochondrion 
0005743 mitochondrial inner membrane 
0006744 ubiquinone biosynthetic process 
0016020 membrane 

Description 

This protein is required for the biosynthesis of ubiquinone (InterPro IPR012762). This oil-soluble, 
vitamin-like substance is present in most eukaryotic cells, primarily in the mithocondria. It is a 
component of the electron transport chain and participates in aerobic cellular respiration, 
generating energy in the form of ATP. 



Protein Name 

ubiquinol-cytochrome C reductase complex ubiquinol 
binding protein (UbCRBP) 



Life-cycle Phase 

Adult worms of mixed 
gender. Egg, Female 
worms, Male worms 



Localization 

Mitochondria 



GO annotation 
null 

Description 

This protein is involved in redox-linked proton pumping as a constituent of the mitochondrial 
respiratory chain in the inner mitochondrial membrane. 



Protein Name 

molybdopterin-binding, putative 
EC: 2.7.7.2 



Life-cycle Phase 

Egg 



Localization 

Cytoplasm (Psort) 



GO annotation 

0003824 catalytic activity 

0003919 FMN adeny lyltransferase activity 

0006747 FAD biosynthetic process 

0006777 Mo-molybdopterin cofactor biosynthetic process 

0008152 metabolic process 

0016740 transferase activity 

0016779 nucleotidyltransferase activity 

Description 

Eukaryotic and prokaryotic molybdoenzymes require a molybdopterin cofactor (MoCF) for their 
activity. The biosynthesis of this cofactor involves a complex multistep enzymatic pathway. One of 



the eukaryotic proteins involved in this pathway is highly similar to a microtubule-associated 
protein (InterPro IPR001453). Molybdenum is essential for most biological systems as it is 
required by enzymes catalyzing reactions in carbon, sulfur and nitrogen metabolism. 
(www.imoa.info/HSE/environmental_data/biology/reviews_of_molybdoenzymes.php). 



Protein Name 

uroporphyrinogen-III synthase, putative 
EC: 4.2.1.75 



Life-cycle Phase 



Localization 

Cytoplasm (GO) / 
Extracellular (Psort) 



GO annotation 

0004852 uroporphyrinogen-III synthase activity 

0016829 lyase activity 

0033014 tetrapyrrole biosynthetic process 



Description 

Uroporphyrinogen III synthase is an enzyme involved in the metabolism of porphyrin. This 
enzyme catalyses the conversion of hydroxjmiethyl bilane into uroporphjTinogen III through the 
inversion of the final pyrrole unit (ring D) of the linear tetrapyrrole molecule, linking it to the first 
pyrrole unit (ring A), thereby generating a large macrocyclic structure, uroporphyrinogen III. 
Porphyrins are a class of organic compounds including heme. Earlier studies from the Padmanaban 
group had shown that the malarial parasite sjmthesizes heme de novo, despite acquiring heme from 
the host red cell hemoglobin in the intraer5^hrocytic stage. In Plasmodium falciparum, the 
uroporphyrinogen-III synthase is not present and its activity is probably supplied by 
porphobilinogen deaminase, which may have a dual function (porphobilinogen — >■ hydroxymethil 
bilane — > uroporphyrinogen III) (Unique Properties of Plasmodium falciparum Porphobilinogen 
Deaminase). 



Protein Name 

4-hydroxyphenylp)Tuvate dioxygenase, putative 
EC: 1.13.11.27 



Life-cycle Phase 

Female worms 



Localization 

Cj^oplasm (Psort) 



GO annotation 

0003868 
0009072 
0055114 

0016701 
0016702 



4-hydroxyphenylpyruvate dioxygenase activity 
aromatic amino acid family metabolic process 
oxidation-reduction process 

oxidoreductase activity, acting on single donors with incorporation of molecular oxygen 
oxidoreductase activity, acting on single donors with incorporation of molecular 



oxygen, incorporation of two atoms of oxygen 



Description 

This Fe-containing enzyme catalyses the second reaction in the catabolism of tyrosine. 



Protein Name 

valacyclovir hydrolase, putative 



Life-cycle Phase Localization 

Adult worms of mixed Mitochondria (Psort) 
gender 



GO annotation 

null 



Description 

Serine hydrolase that catalyzes the hydrolytic activation of amino acid ester prodrugs of nucleoside 
analogs such as valacyclovir and valganciclovir. Activates valacyclovir to acyclovir. May play a 
role in detoxification processes. It is a specific alpha-amino acid ester hydrolase that prefers small, 
hydrophobic, and aromatic side chains and does not have a stringent requirement for the leaving 
group other than preferring a primary alcohol (Uniprot Q86WA6). 



Protein Name 

fatty acid binding protein, putative 



Life-cycle Phase Localization 

Adult worms of mixed Cytoplasm (Psort and 





gender 


GO) 


GO annotation 






0005215 transporter activity 






0005488 binding 






0005737 cytoplasm 






0006810 transport 






0008289 lipid binding 






Description 






The fatty-acid-binding proteins are a family of carrier proteins for fatty acids and other lipophilic 


substances such as eicosanoids and retinoids. These proteins are thought to facilitate the transfer of 


fatty acids between extra- and intracellular membranes. Some family members are also believed to 


transport lipophilic molecules from outer cell membrane to certain intracellular receptors. Levels 


of fatty-acid-binding protein have been shown to decline with ageing in the mouse brain, possibly 


contributing to age-associated decline in sjaiaptic. 




Protein Name 


Life-cycle Phase 


Localization 


mitochondrial carrier protein-related 


Egg 


Mitochondria, 






Cytoplasm (Psort and 






GO) 


GO annotation 






0005488 binding 






0005743 mitochondrial inner membrane 






0006810 transport 






0016020 membrane 






00 1 602 1 integral to membrane 






0055085 transmembrane transport 






Description 






Mitochondrial carriers are proteins from the solute carrier family which transfers molecules across 


the membranes of the mithocondria. These proteins are usually involved in energy transfer. 


Common substrates include: citrate, ornithine, phosphate, glutamate, ADP and ATP. 


Protein Name 


Life-cycle Phase 


Localization 


nuclear transport factor, putative 


Egg 


Nucleus, Extracellular 






(Psort) 


GO annotation 






0005622 intracellular 






0006810 transport 






Description 






These proteins are involved in the transport of other proteins to the cellular nucleus. The transport 


of macromolecules between the nucleus and the cytoplasm is crucial for controlling cellular 


processes such as gene expression, cell division, and signal transduction. (Mapping Interactions 


between Nuclear Transport Factors in Living Cells Reveals Pathways through the Nuclear Pore 


Complex) 






Protein Name 


Life-cycle Phase 


Localization 


peripheral-type benzodiazepine receptor. 


Adult worms of mixed 


Extracellular (Psort) / 


putative 


gender. Egg, Female 


Membrane (GO) 




worms 




GO annotation 






0004872 receptor activity 






0016021 integral to membrane 







Description 

The peripheral-type benzodiazepine receptor or recognition site (PBR) is a widely distributed 
protein, located mainly in the outer mitochondrial membrane. The protein binds to high-affmity 
drug ligands and cholesterol. Many functions are associated directly or indirectly with the protein, 
including the regulation of cholesterol transport and the synthesis of steroid hormones, porphyrin 
transport and heme synthesis, apoptosis, cell proliferation, anion transport, regulation of 
mitochondrial functions and immunomodulation. 



Life-cycle Phase 

Male worms 



Localization 

Cytoplasm (Psort) / 
Membrane (GO) 



Protein Name 

solute carrier family 35 member CI, putative 

GO annotation 

0016020 membrane 

00 1 602 1 integral to membrane 

Description 

In humans, this gene encodes a GDP-fucose transporter that is found in the Golgi apparatus. 
Mutations in this gene result in congenital disorder of glycosylation type lie. 



Protein Name 

B-cell receptor-associated protein-like protein 



Life-cycle Phase 

Female worms 



Localization 

ER-Golgi, Extracellular 
(Psort) / ER, 
Membrane (GO) 



GO annotation 

0004872 receptor activity 
0005783 endoplasmic reticulum 
0006886 intracellular protein transport 
00 1 602 1 integral to membrane 



Description 

Its human homologous is believed to have a role in the anterograde transport of membrane proteins 

from the endoplasmic reticulum to the Golgi. It may also be involved in CASPS-mediated 
apoptosis, since it cleaved by CASP8 and other caspases (UniProt P51572). 



Protein Name 

dynamin-associated protein, putative 



Life-cycle Phase 

All 



Localization 

Cytoplasm (Psort) / 
Membrane (GO) 



GO annotation 

0004872 receptor activity 

0005085 guanyl-nucleotide exchange factor activity 

0005089 Rho guanyl-nucleotide exchange factor activity 

0005509 calcium ion binding 

0005515 protein binding 

0005622 intracellular 

0006897 endocytosis 

0007264 small GTPase mediated signal transduction 

0012505 endomembrane system 

0016020 membrane 

0016021 integral to membrane 
0019209 kinase activator activity 
0019717 synaptosome 
0030027 lamellipodium 
0030054 cell junction 
0030139 endocy tic vesicle 



0035023 regulation of Rho protein signal transduction 

0035556 intracellular signal transduction 

0042327 positive regulation of phosphorylation 

0042995 cell projection 

0043524 negative regulation of neuron apoptosis 

0045202 synapse 

005 1 897 positive regulation of protein kinase B signaling cascade 



Description 

This protein is associated to D5aiamin, which is an eukaryotic GTPase responsible for endoc5^osis. 



Protein Name 

golgi snare bet 1 -related 



Life-cycle Phase 

Female worms 



Localization 

Plasma membrane 
(Psort) 



GO annotation 

null 



Description 

The Betl protein is involved in the vesicular transport between the ER and the Golgi complex. 



Protein Name 

rabb and c, putative 



Life-cycle Phase 

Schistosomula 



Localization 

Cj^oplasm (Psort) 



GO annotation 

0000166 nucleotide binding 
0005525 OTP binding 

0007264 small GTPase mediated signal transduction 
0015031 protein transport 

Description 

The Rab family is a member of the Ras superfamily of monomeric G proteins. Rab GTPases 
regulate many steps of membrane traffic, including vesicle formation, vesicle movement along 
actin and tubulin networks, and membrane fusion. These processes make up the route through 
which cell surface proteins are trafficked from the Golgi to the plasma membrane and are recycled. 
Surface protein recycling returns proteins to the surface whose function involves carrying another 
protein or substance inside the cell, such as the transferrin receptor, or serves as a means of 
regulating the number of a certain type of protein molecules on the surface. 



Protein Name 

Small VCP/p97-interacting protein, putative 


Life-cycle Phase 

Egg, Schistosomula, 
Female worm, Male 
worm 


Localization 

Cytoplasm (Psort) 


GO annotation 

null 



Description 

VCP/p97 is involved in a variety of cellular processes, including membrane fusion and ubiquitin- 
dependent protein degradation. Small VCP/p97-interacting protein is a novelVCP/p97 adaptor 
whose function is related to the integrity of the endoplasmic reticulum (SVIP Is a Novel VCP/p97- 
interacting Protein Whose Expression Causes Cell Vacuolation). Overexpression causes the 
formation of large vacuoles that seemed to be derived from the endoplasmic reticulum. 



Protein Name 

vacuolar ATP synthase proteolipid subunit 1, 2, 
3„ putative 


Life-cycle Phase 

Egg 


Localization 

Membrane (GO) / 
Membrane (Psort) 


GO annotation 



0006810 transport 

0006811 ion transport 

0015078 hydrogen ion transmembrane transporter activity 

00 1 599 1 ATP hydrolysis coupled proton transport 

0015992 proton transport 

0016020 membrane 

0016021 integral to membrane 

0033 1 77 pro ton- transporting two-sector ATPase complex, proton- transporting domain 

0033 1 79 proton- transporting V-type ATPase, VO domain 

Description 

Vacuolar-type ATPases (V-ATPases) are ubiquitous proton pumps that occur in the endomembrane 
system of all eukaryotic cells and in plasma membranes of many animal cells. They have various 
functions including the energization of transport processes across membranes and the regulation of 
the intracellular or intraorganellar pH (Beyenbach and Wieczorek, 2006; Forgac, 2007). V- ATPases 
are heteromultimeric enzymes consisting of a cytosolically oriented catalytic Vi complex, and a 
membrane bound proton translocating Vo complex (Inhibitors of V-ATPases: old and new players). 



Protein Name 

amyloid beta A4 protein related 

GO annotation 

null 



Life-cycle Phase 

Egg 



Localization 

Cj^oplasmic (Psort) 



Description 

Beta amyloid proteins are potentially involved with several physiological processes within the cell, 
such as: the activation of kinases, protection against oxidative stress, regulation of cholesterol 
transport, transcription factor and anti-microbial activity. 



Protein Name 
autophagy protein 16-like 



Life-cycle Phase 

Egg, Schistosomula, 
Female worm, Male 



Localization 

Cytoplasm (Psort) 



worm 



GO annotation 

null 



Description 

This protein is part of a large complex that plays a role in autophagy, the major process by which 
intracellular components are targeted to lysosomes for degradation. Several transcript variants 
encoding different isoforms have been found for this gene. 



Protein Name 

metal dependent hydrolase - related 



Life-cycle Phase Localization 

Adult worms of mixed Cj^oskeleton (Psort) 
gender 



GO annotation 

0016787 hydrolase activity 
Description 

This is a protein with hydrolase activity, that acts on the catalysis of chemical bond hydrolysis. 



Protein Name 

Protein C14orfl53 precursor, putative 

GO annotation 

null 

Description 



Life-cycle Phase 

Female worms 



Localization 

Cytoplasm (Psort) 



This proteins contains an uncharacterized conserved domain (DUF2315); This is a family of small 

conserved proteins found from worms to humans. The function is not known (Pfam). 



Protein Name 

Sj-Tsl , putative 



Life-cycle Phase 



Localization 

C)^oplasm (Psort) 



GO annotation 

null 

Description 

Sj-Tsl is a predicted Schistosoma japonicum protein that contains one transmembrance helix, one 
N-myristoylation site, two phosphorylation sites for protein kinase C and one for tyrosine kinase. 



Protein Name 
tetraspanin D76 , putative 



Life-cycle Phase 

Female worms 



Localization 

Plasma membrane 
(Psort and GO) 



GO annotation 

0016020 membrane 

00 1 602 1 integral to membrane 

Description 

This protein is involved in mechanisms that act during hemocyte phagocytosis, controlling the 
abrupt transition of hemocj^es from resting, nonadherent cells to activated, adherent cells during 
these cell-mediated responses. 



Protein Name 

WD repeat protein SLl-17, putative 

GO annotation 

null 



Life-cycle Phase 

Female worms 



Localization 

Cytoplasm (Psort) 



Description 

WD-repeat proteins are involved in a variety of cellular functions, from signal transduction and 
transcription regulation to cell cycle control, autophagy and apoptosis. One common function of 
all WD-repeat proteins seems to be the coordination of multi-protein complex assemblies, where 
the repeating units serve as a rigid scaffold for protein interactions. 



